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MULT I VALENT AND MULTIS PECIFIC ANTIGEN- BINDING PROTEIN 

FIELD OF THE INVENTION 

5 The present invention relates to multivalent and 

multispecif ic antigen binding proteins, methods for their 
production and uses thereof. In particular, the 
invention relates to binding proteins comprising 
polypeptides which associate to form multivalent or 
10 multispecif ic multimers . 

BACKGROUND OF THE INVENTION 

Antibodies are protein molecules having a structure based 

15 on a unit comprising four polypeptides, two identical 

heavy chains and two identical light chains, which are 
covalently linked together by disulphide bonds. Each of 
these chains is folded in discrete domains. The C- 
terminal regions of both heavy and light chains are 

20 conserved in sequence and are called the constant 

regions, comprising one or more so-called C-domains. The 
N- terminal regions of the heavy and light chains, also 
known as V-domains, are variable in sequence and 
determine the specificity of the antibody. The regions 

25 in the variable domains of the light and heavy chains (V L 

and V H respectively) responsible for antigen binding 
activity are known as the hypervariable or 
complementarity determining regions (CDR) . Natural 
antibodies have at least two identical antigen-binding 

30 sites defined by the association of the heavy and light 

chain variable regions. 

It is known that proteolytic digestion of an antibody can 
lead to the production of antibody fragments . Such 
35 fragments, or portions, of the whole antibody can exhibit 

antigen binding activity. An example of a binding 
fragment is an F ab fragment which comprises a light chain 
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associated with the V H and C H1 domains of a heavy chain. 
The bivalent F(ab L ) 2 fragment comprises two such F ab 
fragments connected together via the hinge region, giving 
two antigen binding sites. F v fragments, consisting only 
5 of the V-domains of the heavy and light chains associated 

with each other may also be obtained. These F v fragments 
are monovalent for antigen binding. Smaller fragments 
such as individual V-domains (domain antibodies or dABs, 
Ward et al Nature, 341 . 544 (1989) and individual CDR's 
10 (Williams et al, Proc . Natl. Acad. Sci, USA, JB_6, 553 7 

(1989)) have also been shown to retain the binding 
characteristics of the parent antibody although generally 
most naturally occurring antibodies need both a V H and V L 
to retain full immunoreactivity . 

15 

Antibody fragments comprising V H and V L domains associated 
together to have antigen binding activity have also been 
described. The single chain F v fragment (scFv) comprises 
a V H domain linked to a V L domain by a flexible 
20 polypeptide linker such that the domains can associate to 
form an antigen binding site (see, for example, EP-B- 
02 81604, Enzon Labs Inc) . 

Microbial expression systems for producing active 
25 antibody fragments are known in the literature. The 

production of Fab in various hosts such as E.coli. 
(Better et al, Science, 240 , 104, (1988)), yeast (Horwitz 
et al, Proc. Natl. Acad. Sci, US4 , 85, 8678 (1988)) and 
the filamentous fungus Trichoderma reesei (Nyyssonen et 
30 al, Bio/Technology, 11 . 591 (1993)) have previously been 

described, for example. It is also known that plants can 
be used as hosts for the production of SCFv fragments 
(Owen et al, Bio/Technology, 10, 790 (1992)) as well as 
whole antibodies. 



An advantage of using antibody fragments rather than 
whole antibodies in diagnosis and therapy lies in their 
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smaller size. They are likely to be less immunogenic 
than whole antibodies and more able to penetrate tissue. 
A disadvantage associated with the use of fragments such 
as the F ab , F v( and S C F V antibody fragments described above, 
5 however is that they have only one binding site for 
antigen binding as compared to the two or more sites 
contained in the whole antibody, preventing polyvalent 
binding to the antigen and hence leading to reduced 
avidity. 

10 

In an attempt to overcome this problem, attention has 
been directed to providing multivalent antigen binding 
proteins, that is binding proteins having more than one 
antigen binding site. In addition, there has been 

15 interest in producing antigen-binding proteins having 

multiple specificities capable of binding to different 
antigenic determinants and containing antigen binding 
domains derived from different sources. Antigen-binding 
proteins having distinct binding specificities may be 

20 useful, for example, in targeting effector cells to 

target cells by virtue of the specific binding of the 
different binding domains. By way of illustration, a 
bispecific antigen binding protein having specificity for 
both tumour cells and cytotoxic drugs may be used to 

25 target specifically cytotoxic drug to tumour cell in an 

efficient manner. By avoiding the need for chemical 
modification, adverse immune responses may be avoided. 

Hitherto, the potential application of multivalent and 
30 multispecif ic antigen binding proteins have been hindered 

by the difficulties in generating and purifying such 
molecules . 

Recombinant antigen-binding proteins having two binding 
35 sites may be prepared by methods such as chemical cross- 

linking of cysteine residues, either through cysteine 
residues introduced at the C- terminus of the V H of an F v 
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(Cumber et al, J. Immunol., 149 , 120 (1992)), through the 
hinge cysteine residues in F ab to generate (Fab 1 ) 2 (Carter 
et al, Bio/Tech., 10, 163 (1992)) or at the C-terminus of 
the V L of an scFv (Pack and Pluckthun, Biochemistry, 21, 
5 1579 (1992)). Alternatively, the production of bivalent 

and bispecific antibody fragments based on the inclusion 
of F ab fragments of C-terminal peptide sequences which 
promote dimerisation has been described. (Kostelny et 
al, J. Immunol., 148, 1547). 

10 

Bivalent or bispecific antibody fragments comprising a 
binding complex containing two polypeptide chains, one 
comprising two heavy chain variable domains (V H ) in series 
and the other comprising two light chain variable domains 
15 (V L ) in series are described in our pending European 

Patent Application No. 95307332.7. 

Multivalent and/or multispecif ic antibody fragments are 
described in WO 94/09131 (Scotgen Limited) . Specific 

2 0 binding proteins having two binding regions, contained at 
least in part on first and second polypeptide chains 
which chains additionally incorporate associating domains 
capable of binding to each other causing the polypeptide 
chains to combine are disclosed therein. It is disclosed 

25 that the first and second binding regions preferably are 

antibody antigen -binding domains, for example comprising 
V H and V L regions contained in a Fab fragment or in a 
single-chain Fv fragment, or may be derived from just one 
of the V H or V L regions of an antibody. The associating 

30 domains may suitably be derived from an antibody and may 

be inter alia antibody V H and V L regions. It is further 
disclosed that using a V H /V L domain combination to achieve 
association leads to the creation of a supplementary Fv 
domain such that the antibody produced may be trivalent . 

35 Schematic representations of the arrangements suggested 
in WO 94/09131 to produce trivalent fragments are shown 
in Figure 1A. 
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WO 93/11161 (Enzon Inc) describes multivalent antigen- 
binding proteins comprising two or more single-chain 
protein molecules, each single chain molecule comprising 
first and second polypeptides each comprising the binding 
5 portion of the variable region of an antibody heavy or 
light chain with the polypeptides being linked together 
via a peptide linker. Hypothetical trimers and tetramers 
are discussed, comprising three or four single-chain 
antigen binding proteins as appropriate. Schematic 
10 representations of the trivalent arrangements suggested 

are shown in Figure IB. 

WO 91/19739 (Celltech Limited) discloses multivalent 
antigen binding proteins comprising an Fv fragment bound 

15 to at least one further Fv fragment by a connecting 

structure which links the Fv fragments together but which 
maintains them spaced apart such that they can bind to 
adjacent antigenic determinants. Conveniently the 
connecting structure consists of a spacing polypeptide 

20 and a linkage unit such as a cross-linking maleimide 

linker or a molecule which allows for non-covalent 
binding. Particularly preferred connecting structures 
which are disclosed are based on antibody joining and 
hinge region sequences . 

25 

SUMMARY OF THE INVENTION 

According to the present invention there is provided a 
multivalent antigen binding protein comprising: 

30 

a first polypeptide comprising in series, three or 
more variable domains of an antibody heavy chain; 
and 

35 a second polypeptide comprising, in series, three or 

more variable domains of an antibody light chain, 
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said first and second polypeptides being linked by- 
association of the respective heavy chain and light chain 
variable domains, each associated variable domain pair 
forming an antigen binding site. 

5 

As used herein, the term multivalent means more than one 
antigen binding site. 

Preferably the first polypeptide comprises three variable 
10 domains of an antibody heavy chain and the second 

polypeptide comprises three variable domains of an 
antibody light chain, providing a trivalent protein. 

It will be appreciated that the polypeptides may comprise 
15 heavy or light chains, variable domains, as appropriate, 

or functional equivalents thereof. 

The respective heavy or light chain variable domains may 
suitably be linked without any intervening linker. 

20 According to a preferred embodiment, however, the 

variable domains contained in the individual polypeptides 
are linked by peptide linkers. Preferably the peptide 
linker is flexible, allowing the variable domains to flex 
in relation to each other such that they can bind to 

25 multiple antigenic determinants simultaneously. It will 
be appreciated that the binding of the linker to the 
individual heavy or light chain variable domains will be 
such that it does not affect the binding capacity of the 
binding site formed by the associated variable domain 

30 pair. Conveniently the peptide linker comprises from 16 

to 19 amino acid residues. A preferred, peptide linker 
for heavy chain domains is (Gly 4 Ser) ,AlaGlySerAla and for 
the light chain domains is (Gly 4 Ser) 3 Val . 

35 It will be appreciated that if two or more of the 

associated variable domain pairs (V H /V L pairs) have the 
same antigen specificity, for example if they are derived 
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from the same parent antibody or fragment thereof or from 
different antibodies which bind the same epitope, then a 
binding protein which binds more than one molecule of the 
same type will be produced. 

According to one embodiment, where the binding protein 
according to the invention comprises three antigen 
binding sites which are able to bind different epitopes 
from each other, a trivalent trispecific protein is 
produced . 

In another embodiment, where the binding protein 
according to the invention comprises three associated 
variable domain pair binding sites, two of which sites 
bind the same epitopes, a trivalent, bispecific protein 
is provided. Where all three binding sites have the same 
antigen specificity, a trivalent, monospecific binding 
protein is provided. 

The invention also provides nucleotide sequences coding 
for the polypeptides of the multivalent antigen binding 
protein according to the invention and cloning and 
expression vectors containing such nucleotide sequences. 

The invention further provides host cells transformed 
with vectors containing such nucleotide sequences and 
methods of producing such polypeptides by expression of 
the nucleotide sequences in such hosts . 

The invention further provides a process for preparing a 
multivalent antigen binding protein as set forth above 
comprising: 

(i) transforming one or more hosts by incorporating 
genes encoding said first and second 
polypeptides ,- 

(ii) expressing said genes in said host or hosts,- 
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(iii) allowing said first and second polypeptides to 
combine to form the antigen binding protein. 

Suitably the host or hosts may be selected from 
5 prokaryotic bacteria, such as Gram-negative bacteria, for 
example E.Coli, and Gram-positive bacteria, for example 
B . subtilis or lactic acid bacteria, lower eukaryotes 
such as yeasts, for example belonging to the genera 
Saccharomyces Kluyveromyces or Tri cho derma , moulds such 

10 as those belonging to the genera Aspergillus and 

Neurospora and higher eukaroytes, such as plants, for 
example tobacco, and animal cells, examples of which are 
myeloma cells and CHO, COS cells and insect cells. A 
particularly preferred host for use in connection with 

15 the present invention is COS (monkey kidney) cells. 

Techniques for synthesising genes, incorporating them 
into hosts and expressing genes in hosts are well known 
in the art and the skilled person would readily be able 
to put the invention into effect using common general 
knowledge. Proteins according to the invention may be 
recovered and purified using conventional techniques such 
as affinity chromatography, ion exchange chromatography 
or gel filtration chromatography. 

The activity of the multivalent binding proteins 
according to the invention may conveniently be measured 
by standard techniques known in the art such as enzyme- 
linked immunosorbant assay (ELISA) , radioimmune assay 
(RIA) or by using biosensors. 

The multivalent antigen binding proteins of the present 
invention may suitably be used in diagnostics or therapy 
for example in targeting a tumour cell with natural 
35 killer cells and cytotoxic agent. Other uses for which 

the multivalent binding proteins according to the 
invention are useful include those uses for which 



25 
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antibodies or fragments thereof are commonly used, 
including for immunoassays and in purification. 
According to a particular preferred embodiment, multi- 
enzyrae complexes may be assembled, at a target, for 
example a cell surface. As an illustration, multivalent 
binding proteins according to the invention may be used 
to target cell killing enzymes such as an oxidase (for 
example glucose oxidase) and peroxidase (for example 
horseradish peroxidase) to a target species which is an 
antigenic component of dental plaque, such as S. sanguis 
or S. mutans. Complexes comprising enzyme, coenzyme and 
target antigen may also conveniently be assembled. 

Accordingly, the invention also provides compositions 
comprising the multivalent antigen binding proteins 
according to the invention, conveniently in combination 
with a cosmetically or pharmaceutically acceptable 
carrier, diluent or excipient . Methods of treatment 
using the multivalent antigen binding proteins according 
to the invention are also provided. 

For use in diagnosis or therapy, the multivalent antigen 
binding proteins according to the invention may 
conveniently be attached to an appropriate diagnostically 
or therapeutically effective agent or carrier by methods 
conventional in the art. 

An advantage of using multivalents antigen binding 
proteins according to the invention over multivalent 
binding proteins prepared by existing techniques known in 
the art is that the " self -assembling" association of the 
respective heavy and light chain variable domains to form 
the multivalent binding sites avoids the need for 
chemical coupling steps or the introduction of linking 
residues to stabilise the multivalent constructs, thereby 
minimising the risk of eliciting an immune response to 
such molecules when the resulting multivalent binding 
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proteins are used in therapy. 

A particular advantage of molecules according to the 
present invention is that they may conveniently be 
5 purified straight from the supernatant using conventional 

purification techniques. As they are self -assembling, 
there is no need to purify individual subunits prior to 
coupling as in existing techniques. 

10 The present invention may be more fully understood with 
reference to the following description, when read 
together with the accompanying drawings . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A and IB show schematic representations of 
published arrangements of heavy and light chain V-domain 
gene fragments that have been suggested to produce 
trispecific or trivalent antibody fragments: 

A) scFvl-VLa + scFv2-VHa (2 chains) WO 94/09131 

B) Fabl-Vla + Fab2-VHa (4 chains) WO 94/09131 

C) scFvl-VLa-CLa + scFvl-VHa-CHa (2 chains) WO 94/09131 

D) Fabl-VLa-CLa + Fab2-VHa-CHa (4 chains) WO 94/09131 

E) scFvl + scFv2 + scFv3 (3 chains) WO 93/11161 

F) VH1-VL2 + VH2-VL3 + VH3-VH1 (3 chains) WO 93/11161 

Figure 2A/B shows the nucleotide sequence of the EcoRl- 
Hindlll insert of pGOSA.E2t containing DNA encoding pelB 
30 leader- VH4715 -linker- VL3418 and DNA encoding pelB leader- 

VL3418-linker-VH4715-hydrophil2 tag (SEQ ID No. 1) . 

Figure 3 

35 a) shows the nucleotide sequence of the Hindlll-EcoRI 

insert of plasmid scFv.Lys with DNA encoding pelB leader- 
VHLys-1 inker -VLLys (SEQ ID No. 2) . 



20 
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B) shows the nucleotide sequence of the Hindlll-EcoRI 
insert of plasmid scFv.4715.2t with DNA encoding pelB 
leader-VH4715.2t (SEQ ID No. 3). 

5 Figure 4 shows the nucleotide sequence of the genomic 

leader sequence of the anti-NP antibody (Jones et al, 
Nature, 321 . 522) . Exon sequences are indicated with 
shaded boxes. Ncol and PstI restriction sites are in 
bold and underlined (SEQ ID No. 4) . 

10 

Figure 5 gives a schematic representation of the 
eukaryotic expression vector pSV.51. 

Figure 6 gives an overview of the pUC19 double head (A) 
15 and triple head (B) constructs. The position of the 

oligonucleotides and the restriction sites used for 
assembling double and triplehead pUC constructs are 
indicated. 

2 0 Figure 7 

A) shows the origin of the VH-C-linker and VL-C-linker 
fragments . 

25 B) gives a schematic representation of the construction 

of the pUC. 19 -triple-head vectors. 

Figure 8 

30 A) gives a schematic representation of the construction 

of the Euka.VH and Euka.VL vectors. 

B) gives a schematic representation of the construction 
of the pSV.VH expression vectors. 

35 

C) gives a schematic representation of the construction 
of the pSV.VL expression vectors. 
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Figure 9 shows the expression of the trispecific Golysan 
proteins on an SDS-PAGE gel containing total COS culture 
supernatant. Crude supernatants of COS cells transfected 
with pSV expression vectors were separated on SDS-PAGE 
5 gels. The proteins were transferred onto a 

nitrocellulose membrane and the VH3 and VL3-2t were 
detected using anti-VH and anti-hydrophil 2 tag specific 
monoclonal antibodies respectively. (A=anti-Hydro-II , 
B=anti-Hydro-II + anti-VH) Samples: M) Low Molecular 
10 Weight Markers, 1) pSV.K + pSV.V,2) pSV.K + pSV.W,3) 

pSV.M + pSV.V,4) pSV.M + pSV.W. 

Figure 10 shows the results of three ELISA's. Lysozyme, 
Glucose oxidase and S. sanguis binding activity was 
15 determined in crude COS supernatants by measuring 1) 

Lysozyme -Glucose oxidase (=LYS0X) , 2) Glucose oxidase- 
S. sanguis (=G0SA) and 3) Lysozyme- S . sanguis (= LYSAN) 
bispecific binding activities. 

20 Figure 11 shows the results of three ELISA's. Lysozyme, 
Glucose oxidase and S. sanguis binding activity of 
purified Golysan. A (A) and Golysan. B (B) was determined 
by measuring 1) Lysozyme-Glucose oxidase (=LYSOX) , 2) 
Glucose oxidase-S . sanguis (=G0SA) and 3) Lysozyme- 

25 S. sanguis (=LYSAN) bispecific binding activities. 

Figure 12 shows the nucleotide sequence of the EcoRI- 
HindlH insert of pUR.4124 containing DNA (see SEQ ID NO: 
23) encoding V L Lys - Linker- V H Lys . 

30 

Figure 13 shows the nucleotide sequence of the Hindi II- 
EcoRI insert of plasmid Fv.3418 (see SEQ ID NO: 24) 
containing DNA encoding pelB leader-V H 3418 and pelB 
leader-V L 3418 . 



Figure 14 shows the nucleotide sequence of the Hindi II- 
EcoRI insert of plasmid Fv.4715-myc (see SEQ ID NO: 25) 
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containing DNA encoding pelB leader-V H 4715 and pelB 
leader-V L 4715-Myc tag. 

Figure 15 shows the nucleotide sequence of the Hindi II- 
5 EcoRI insert of scFv. 4715-myc containing DNA (see SEQ ID 

NO: 26) encoding pelB leader-V H 4715-Linker-V L 4715-Myc tag. 

Figure 16a/b shows the nucleotide sequence of the 
Hindlll-EcoRI insert of pGOSA.E (see SEQ ID NO: 27) 
10 containing DNA encoding pelB leader-V H 4715-Linker-V L 3418 

and pelB leader-V L 3418 -Linker-V H 4715 . 

Figure 16c gives an overview of the oligonucleotides and 
their positions in pGOSA.E that can be used to replace V- 
15 domain gene fragments. 

Figure 17 shows the .construction of plasmid pGOSA.A. 

Figure 18 shows the construction of plasmid pGOSA.B. 

20 

Figure 19 shows the construction of plasmid pGOSA.C. 
Figure 20 shows the construction of plasmid pGOSA.D. 
25 Figure 21 shows the construction of plasmid pGOSA.E. 

Figure 22 shows the source of fragment PCR.I BstEII/SacI. 
Figure 23 shows the source of fragment PCR.IV XhoI/EcoRI. 

30 

Figure 24 shows the source of fragment PCR.V Sa.ll/EcoR.I. 
Figure 25 shows the source of fragment PCR.III Nhel/Sacl. 
35 Figure 26 shows the source of fragment PCR.II Sfil/EcoRI . 



Table 1 shows the nucleotide sequence of all 
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oligonucleotides used in the construction of the 
described double and triple head constructs . 

Table 2 lists all pSV expression constructs described 
5 in this specification. 

The following examples are provided by way of 
illustration only: 

10 EXAMPLES 

General Ex perimental 

Strains, Plasmids and Media 

15 All cloning steps were performed in E.Coli JM109 or 

E.Coli XL-1 Blue. Cultures were grown in 
2xTY/Amp/Glucose medium (16g tryptone, lOg yeast extract, 
5g NaCL per liter H 2 0 supplemented with 2% glucose and 
100^ig/ml ampicillin) . Transformations were plated out on 

20 SOBAG plates (2 0g tryptone, 5g yeast extract, I5g agar, 
0.5g NaCl per liter H 2 0 plus lOmM MgCl 2 , 2% glucose, 
100/ig/ml ampicillin) . The bicistronic E.coli vectors 
used are derivatives of pUC19 . The COS expression vector 
pSV.51 (LMBP strain nr 1829) was obtained from the LMBP 

25 Culture collection (Laboratory of Molecular Biology 

University Gent). COS-1 cells (ECACC No: 88031701; 
African green monkey kidney cells) were obtained from the 
European Collection of Animal Cell Cultures (ECACC) . All 
tissue culture reagents were from Gibco BRL (Life 

30 Technologies, Paisley, UK) 

DNA Manipulations 
Oligonucleotides and PCR 

The oligonucleotide primers used in the PCR reactions 
35 were synthesized on an Applied Biosys terns 381A DNA 

Synthesiser by the phosphoramidite method. The primary 
structures of the oligonucleotide primers used in the 
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construction of the trispecific pSV constructs (Table 2) 
are shown in Table 1. Reaction mixture used for 
amplification of DNA fragments were lOmM Tris-HCl, pH8.3, 
2.5mM MgCl 2 , 50mM KC1, 0.01% gelatin (w/v) , 0.1% Triton X- 
5 100, 400mM of each dNTP, 5.0 units of Vent DNA polymerase 

(New England Biolabs) , lOOng of template DNA, and 500ng 
of each primer (for 100/il reactions) . Reaction 
conditions were: 94°C for 4 minutes, followed by 33 
cycles of each 1 minute at 94°C, 1 minute at 55°C, and 1 
10 minute 72 °C. 

Plasmid DNA \Vec tor \ Insert preparation and 
ligation \ transformation . 

Plasmid DNA was prepared using the ' Qiagen P-100 and P- 

15 500 Midi/Maxi-DNA Preparation' system. Vectors and 

inserts were prepared by digestion of 10/xg (for vector 
preparation) or 20^g (for insert preparation) with the 
specified restriction endonucleases under appropriate 
conditions (buffers and temperatures as specified by 

20 suppliers) . Klenow fill-in reactions and 

dephosphorylation with Calf Intestine Phosphorylase were 
performed according to the manufacturers instructions. 
Vector DNA's and inserts were separated through agarose 
gel electrophoresis and purified with DEAE- membranes NA45 

25 (Schleicher & Schnell) as described by Maniatis et al . 

(Molecular cloning: a Laboratory manual, Cold Spring 
Harbour, N.Y. (1982)) Ligations were performed in 20jUl 
volumes containing 30mM Tris-HCl pH7 . 8 , lOmM MgCl 2 , lOmM 
DTT, ImM ATP, 300-400ng vector DNA, 100-200ng insert DNA 

30 and 1 Weiss unit T 4 DNA ligase. After ligation for 2-4 h 

at room temperature, CaCl 2 competent E. coli JM109 or XL-1 
Blue (Maniatis et al) were transformed using 7.5/zl 
ligation reaction. The transformation mixtures were 
plated onto SOB AG plates and grown overnight at 37°C. 

35 Correct clones were identified by restriction analysis 

and verified by automated dideoxy sequencing (Applied 
Biosystems) . 
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Restriction digestion of PCR products 

Following amplification each reaction was checked for the 
presence of a band of the appropriate size by agarose gel 
electrophoresis. One or two 100/il PCR reaction mixtures 
5 of each of the PCR reactions, together containing 

approximately 2-4/zg DNA product were subjected to phenol - 
chloroform extraction, chloroform extraction and ethanol 
precipitation. The DNA pellets were washed twice with 
70% ethanol and allowed to dry. Next, the PCR products 
10 were digested overnight (18 h) in 200^L lxBuffer with 

excess of the appropriate restriction enzyme. 

Transformation of COS Cells 

Cos-1 cells were maintained in DMEM culture medium with 

15 glutamine (2mM) , Penicillin (lOOU/mL) , streptomycin 

(100/xg/mL) containing 10% F.C.S. For transient 
transfection assays 1-3 xlO 5 COS-1 cells were seeded in 3 
cm-diameter tissue culture dishes (2mL) . The cells were 
incubated at 37°C in a C0 2 incubator until cells were 50- 

20 80% confluent (overnight) . For each transfection the 

following mixes were prepared: A) l/zg of each of the 
specified DNA 1 s in lOO^L Opti-MEM-I Reduced Serum Medium, 
B) ifih Lipof ectAmine in 100/iL Opti-MEM-I Reduced Serum 
Medium. Mixes A and B were combined (gently) . After 

25 allowing the DNA-liposome complexes to form for 30-45 

minutes at room temperature, 0 . 8mL Opti-MEM-I Reduced 
Serum Medium was added to each lipid DNA complex 
containing tube. The COS-l cells were washed once with 
2mL of Opti-MEM-I Reduced Serum Medium and overlayed with 

30 the diluted complex solution. The COS-1 cells were 

incubated for 5 hr at 37°C. Following incubation, 2mL 
growth medium was added. 20 hours following transfection 
the medium was replaced with 2mL fresh growth medium 
containing O.lmM Na-butyrate. After 48 hours incubation 

35 at 37 °C the supernatant was harvested and assayed for the 
presence of antibody fragments. 
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EL ISA 

A) GOSA: Glucose Oxidase and S. sanguis binding activity 

96 well EL ISA plates (Greiner HC plates) were activated 
5 overnight at 37°C with 200txl/well of a 1/10 dilution of 

an overnight culture of Streptococcus sanguis cells in 
0.05M sodium carbonate buffer pH9.5 was used to sensitise 
each well. Following one wash with PBST, the antigen 
sensitised plates were pre-blocked for 1 hour at 37°C 

10 with 200/xl/well blocking buffer (1% BSA, 0.15% Tween in 

PBS) . 50/xl COS culture supernatants (neat or diluted 
with PBS) plus 50/xl blocking buffer containing glucose 
oxidase (50/zg/ml) was added to the Streptococcus Sanguis 
sensitised plate and incubated for 2 hours at 37 °c. 

15 Following 4 washes with PBS-T, bound glucose oxidase was 

detected by adding 100/zl substrate to each well (70mM Na- 
citrate, 320mM Na-phosphate, 27mg/ml glucose, 0.5/xg/ml 
HRP, 100/xg/ml TMB) . The colour reaction was stopped 
after 1 hour by the addition of 35/xl 2M HC1 and the A450 

2 0 was measured. 

B) LYSOX: Lyaozyme and Glucose Oxidase binding activity 

96 well ELISA plates (Greiner HC plates) were activated 
overnight at 37°C with lysozyme (50/xg/mL in 0 . 05M sodium 
25 carbonate buffer pH9 . 5 ; 200/xl/well). Following one wash 

with PBST, the antigen sensitised plates were pre-blocked 
for 1 hour at 3 7°C with 200/xl/well blocking buffer (1% 
BSA, 0.15% Tween in PBS). 50/xl COS culture supernatants 
(neat or diluted with PBS) plus 50/tl blocking buffer 

3 0 containing glucose oxidase (50/xg/ml) was added to the 

Streptococcus Sanguis sensitised plate and incubated for 
2 hours at 37°C. Following 4 washes with PBS-T, bound 
glucose oxidase was detected by adding 100/xl substrate to 
each well (70mM Na-citrate, 320mM Na-phosphate, 27mg/ml 
35 glucose, 0.5/xg/ml HRP, 100/xg/ml TMB) . The colour 

reaction was stopped after 1 hour by the addition of 35/xl 
2M HC1 and the A450 was measured. 
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C) LYSAN: S. sanguis and Lysozyme binding activity 

96 well EL ISA plates (Greiner HC plates) were activated 
overnight at 37°C with 200/il/well of a 1/10 dilution of 
an overnight culture of Streptococcus sanguis cells in 
5 0.05M sodium carbonate buffer pH9 . 5 was used to sensitise 
each well . Following one wash with PBST, the antigen 
sensitised plates were pre-blocked for 1 hour at 37°C 
with 200/il/well blocking buffer (1% BSA, 0.15% Tween in 
PBS) . 50/il COS culture supernatants (neat or diluted 

10 with PBS) plus 50^1 blocking buffer was added to the 

Streptococcus Sanguis sensitised plate and incubated for 
2 hours at 37°C. Following 4 washes with PBS-T, 50^L 
blocking buffer containing Alkaline- Phosphatase 
conjugated Lysozyme (100/x/mL) . Unbound Lysozyme was 

15 removed by 4 washes with PBS-T. Bound Lysozyme was 

detected by adding 100/iL substrate solution to each well 
(lmg/tnl pNPP in 1M diethanolamine, ImM MgCl 2 ) . After 1 
hour the A4 05 was measured. 

20 EXAMPLE 1: Construct ion of the pSV.Golvsan expression 

vectors 

The construction of the pSV COS expression vectors 
consisted of three stages: 

25 

IA) : Assembly of 2 heavy chain variable domains and 2 

light chain variable domains in a pUC based E.Coli 
expression vector thus constructing the VH A -VH B and 
VL A -VL B modules respectively. 

30 

IB) : Assembly of 3 heavy chain variable domains and 3 

light chain variable domains in a pUC based E.Coli 
expression vector thus constructing the VH A -VH B -VH C 
and VL A -VL B -VL C modules respectively. 

35 

2) Linking the VH A -VH B( VH A -VH B -VH C and VL A -VL B , VL A -VL B - 
VL C to the genomic anti-NP leader sequence in the 
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intermediate "EUKA" vectors to ensure efficient 
secretion by COS cells. 

3) Inserting the leader- VH A -VH B , leader-VH A -VH B -VH c and 
5 leader-VL A -VL Bi leader-VL A -VL B -VL c as Xbal/Xbal 

fragments downstream of the SV4 0 promoter in the COS 
expression vector pSV.5l. 

ad.i; E.coli expression vectors. 

The E.coli expression vectors are derivatives of pUC.19 
containing a Hindlll-EcoRI fragment that in the case of 
the scFv.lys-myc contains a pelB signal sequence fused to 
the 5' end of the heavy chain V-domain that is directly 
linked to the corresponding light chain V-domain of the 
antibody through a connecting sequence that codes for a 
flexible peptide (Gly 4 Ser) 3 thus generating a single-chain 
molecule. In the "double head 1 expression vector both 
the heavy chain and the light chain V- domains of the 
antibody are preceded by a ribosome binding site and a 
pelB signal sequence in an artificial dicistronic operon 
under the control of a single inducible promoter. 
Expression of these constructs is driven by the inducible 
lacZ promoter. The nucleotide sequence of the Hindlll- 
EcoRI inserts of the scFv. lys-myc, scFv.4715.2t and 
pGOSA.E2t constructs used for the generation of the 
trispecific antibody fragments are listed in Figures 3 
and 2 respectively. 

ad.lAJ Assembly of bi-specific fragments or double heads. 

30 The construct pGOSA.E2t (Figures 2 and 6A) is derived 

from the E.coli expression construct pGOSA.E. The 
construction of pGOSA.E has been described in detail in 
preparation 1 below. 

3 5 In contrast with pGOSA.E, pGOSA.E2t contains a peptide 

tag at the C- terminus of the Variable light chain. Using 
oligonucleotides DBL3 and DBL.4 the VL4715 gene fragment 



15 



20 
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was amplified using scFv.47l5.2t as a template. The 
Sall/BamHI VH4715.2t PCR fragment and the Hydrophil-2 tag 
containing BamHI/EcoRI fragment from scFv.47l5.2t (Figure 
3B) were used to replace the Sall/EcoRI VH4715 fragment 
5 in pGOSA.E thus producing pGOSA.E2t. 

The vector pGOSA.E2t and the oligonucleotides in Table 1 
have been designed to enable most specificities to be 
cloned into the pGOSA.E2t construct (Figure 6A) . The 

10 upstream V H domain can be replaced by any Pstl-BstEII V H 

gene fragment obtained with oligonucleotides PCR. 51 and 
PCR. 89. The oligonucleotides DBL.l and DBL.2 were 
designed to introduce Sfil and Nhel restriction sites in 
the V H gene fragments thus allowing cloning of those V H 

15 gene fragments into the Sfil-Nhel sites as the downstream 

V H domain. Using this approach the following VH A -VH B 
combinations were constructed: VH4715-VH3418 , VH4715- 
VHlys, VH3418-VHlys, VHlys-VH3418 . 

20 All V L gene fragments obtained with oligonucleotides 

PCR. 116 and PCR. 90 can be cloned into the position of the 
3418 V L gene fragment as a Sacl-Xhol fragment. A 
complication here however is the presence of an internal 
Sad site in the 3418 V H gene fragment. Oligonucleotides 

25 DBL.3 and DBL.4 are designed to allow cloning of V L gene 

fragments into the position of the 4 715 V L gene fragment 
as a Sall-BamHl fragment. A complication here however is 
the presence of an internal BamHI site in the hydrophil- 
2 -tag gene fragment (2t) . Using this approach the 

3 0 following VL A -VL B combinations were constructed: VL3418- 

VL4715.2t, VLlys-VL4715.2t and VLlys-VL3418 . 2t . 

ad. IB) Assembly of tri-specific fragments or triple 
heads. 

35 Amplification of the VH-linker fragments using either 

scFv (VH-1 inker- VL) or bi-specific constructs (VH-linker- 
VH) as template with the primer combination DBL . l/DBL . 5 
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(Figure 7A) yields one of the building blocks for the 
construction of the VH A -VH B -VH C modules. The VH-linker 
DBL.1/DBL.5 PCR fragment is digested with Sfil and 
inserted into the Sfil site that is present between the 
5 linker sequence and the downstream VH domain in all bi- 
specific constructs (Figure 7B) thus producing a VH A -VH B - 
VH C module. Using this approach the following VH A -VH B -VH C 
combinations were constructed for this filing: VH4715- 
VHlys-VH3418 and VHlys-VH4715-VH3418 . 

10 

Using a bi-specific construct (VL-linker-VL) as the 
template in an amplification reaction with the primer 
combination DBL . 3/DBL . 6 (Figure 7A) yields the VL- linker 
building block for the construction of the VL A -VL B -VL C 

15 modules. The VL-linker DBL. 3/DBL. 6 PCR fragment is 

digested with Sail and inserted into the Sail site that 
is present between the linker sequence and the downstream 
VL domain in all bi-specific constructs (Figure 7B) thus 
producing a VL A -VL B -VL C module. Using this approach the 

20 following VL A -VL B -VL C combinations were constructed: 
VLlys-VL4715-VL3418.2t and VL3418-VLlys-VL4715 . 2t . 

A schematic representation of the final tri-specific 
constructs is shown in Figure 6B. 

25 

ad. 2) Linking the variable region domains to the leader 
sequence . 

The Hindlll/EcoRI polylinker of pUC19 was replaced with a 
synthetic EcoRI/Hindlll 'Euka' polylinker. This was 

30 achieved by annealing and inserting the synthetic 

oligonucleotides Euka.l and Euka. 2 (Table 1) into 
EcoRI/Hindlll digested pUC19 vector. The resulting 
Euka.pUC vector contains all restriction sites needed for 
the subcloning of the leader sequence and the VH and VL 

35 domains. The NcoI/PstI genomic anti-NP leader sequence 
fragment was cloned into the NcoI/PstI digested Euka.pUC 
vector yielding the Euka.VH construct (Figure 8A) . 
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Oligonucleotides ML.l and ML. 2 (Table 1) were used in an 
amplification reaction to introduce a SacI site at the 3' 
end of the leader sequence that allows the construction 
of leader-VL fusions. The NcoI/SacI leader sequence PCR 
5 fragment was inserted into NcoI/SacI digested Euka.pUC 

vector yielding the Euka.VL construct (Figure 8A) . 

The VH A -VH B and VH A -VH B -VH C modules were excised from the 
pUC expression vectors as Pstl/Nhel fragments and 

10 inserted into Pstl/Nhel digested Euka.VH vector (Figure 

8B) . Using this approach the following leader-VH A -VH B and 
leader-VH A -VH B -VH c combinations were constructed for this 
filing: Euka.B: leader-VH4715-VH3418 , Euka.D: leader- 
VH4715-VHlys, Euka.G: leader-VH3418 -VHlys , Euka.K: 

15 leader-VH4715-VHlys-VH3418 and Euka.M: leader-VHlys- 

VH4715-VH3418 . 

The VL A -VL B and VL A -VL B -VL C modules were excised from the 
pUC expression vectors as EcoRI-Klenow/SacI fragments and 

20 inserted into Not I -Klenow/SacI treated Euka.VL vector 

(Figure 8C) . Using this approach the following leader- 
VL a -VLb and leader- VL A -VL B -VL C combinations were 
constructed: Euka.N: leader-VL3418-VL4715 .2t, Euka.P: 
leader-VLlys-VL4715.2t Euka . S : leader-VLlys-VL34l8 . 2t , 

25 Euka.V: leader-VLlys-VL4715-VL3418 .2t and Euka.W: leader- 

VL3418-VLlys-VL4715 . 2t . 

ad. 3) Subcloning of leader-variable domain fusions into 
the pSV. 51 expression vector 

30 All leader-VH A -VH B , leader-VH A -VH B -VH c , leader-VL A -VL B and 

leader-VL A -VL B -VL c combinations were excised from the 
'Euka' vectors as Xbal/Xbal fragments and subcloned 
downstream of the SV40 promoter in pSV.51 (Figure 5) by 
insertion into the Xbal site (Figure 8B and 8C) . After 

35 confirmation of the correct orientation of the inserts 

the pSV expression vectors were used to transfect COS-l 
cells (see Example 2) . The pSV expression vectors used 
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are listed in Table 2. 

Example 2: Bifunctional binding activity of Golvsan 
triple heads 

5 

This example describes the production of three types of 
bispecific binding activity by COS-1 cells transfected 
with expression plasmids encoding the corresponding VH A - 
VH B -VH C and VL A -VL B -VL C genes fragments. 

10 

2. Production of antibody fragments by COS-1 cells 

Supernatants of COS-1 cells transfected with combinations 
of pSV-VH A -VH B -VH c and pSV-VL A -VL B -VL c expression plasmids 
were separated on 10% SDS-PAGE and transferred onto 

15 nitrocellulose. The resulting Western blots were 

screened with a monoclonal antibody recognising a peptide 
sequence in framework 4 of the VH domains (region encoded 
by PCR.89: conserved in all used VH domains, {in-house 
reagent}) and/or a monoclonal specific for the hydrophil- 

20 2 tag. As shown in Figure 9 all supernatants contained 

products with the expected molecular weight of the VH A - 
VH B -VH C and VL a -VLb-VL c fragments, indicating that the COS 
cells were successfully tranfected and were secreting the 
produced antibody fragments into the culture medium at 

25 detectable levels. 

2. Bifunctional binding activity 

Supernatants of COS-1 cells transfected with single pSV 
expression plasmids and combinations of pSV expression 
30 plasmids were tested for the production of bifunctional 

binding activity using ELISA format: 

* Supernatants of COS-1 cells transfected with the 
bispecific positive controls 1 LYSAN 1 (pSV.D + pSV.P), 
35 'LYSOX' (pSV.G + pSV.S) and 'GOSA' (pSV.B + pSV.N) only 

produced LYSAN, LYSOX and GOSA bispecific activity 
respectively (Figure 10) . No significant cross 
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reactivity was detected. 

* Supernatants of COS-1 cells transfected with only one 
expression vector encoding either one of the VH A -VH B -VH C 

5 fragments (pSV.K and pSV.M) or one of the VL A -VL B -VL C 

fragments (pSV.V and pSV.W) did not exhibit any 
bispecific binding activity, indicating that no 
background binding or a specific binding activity is 
produced . 

10 

* All tested supernatants of COS-1 cells transfected with 
an expression vector encoding one of the VH A -VH B -VH C 
fragments (pSV.K and pSV.M) and an expression vector 
encoding one of the VL A -VL B -VL C fragments (pSV.V and 

15 pSV.W) showed significant levels of all three 

bifunctional binding activities LYSOX, GOSA and LYSAN. 

These results show that COS cells transfected with 
expression vectors encoding VH A -VH E -VH C and expression 

20 vectors encoding VL A -VL B -VL C fragments produce and secrete 

molecules that contain three binding activities. In this 
example those three activities are: Glucose Oxidase 
binding, S. sanguis binding and Lysozyme binding. 
Furthermore, the results illustrated in Figure 10 clearly 

25 show that at least £wQ of these binding activity are 

present in one self assembling molecular complex. In 
this example those combinations are: GOSA (Glucose 
Oxidase + S. sanguis), LYSOX (Lysozyme + Glucose Oxidase) 
and LYSAN (Lysozyme + S. sanguis). 

30 

Example 3: Trif unction al binding activity of Golysan 
triple headp 

This example describes experiments that show that the 
35 three types of bispecific binding activity that are 

produced by COS-1 cells transfected with expression 
plasmids encoding the corresponding VH A -VH B -VH C and VL A - 
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VL B -VL C genes fragments are present in one self assembling 
molecular complex. 

Golysan.A (VHlys-VH4715-VH3418 + VLlys-VL4715 -VL3418 . 2t) 
5 and Golysan.B (VHlys-VH4715-VH3418 + VL3418-VLlys- 

VL47l5.2t) was purified by affinity chromatography. 
100ml supernatant of COS-1 cells transfected with 
expression plasmids pSV.M/pSV.V (Golysan.A) or 
pSV.M/pSV.W (Golysan.B) were loaded onto a Lysozyme- 

10 Sepharose column (CNBr-Sepharose, Pharmacia; column was 

prepared according to the manufacturer's instructions). 
After extensive washes with PBS the bound Golysan 
antibody fragments were eluted in 0 . 1M glycine buffer at 
pH=2.2. The fractions were neutralised with Tris and 

15 tested for the presence of trispecific binding activity. 

As shown in Figure 11 no bispecific binding activity was 
detect in the column fall- through . All three bispecific 
binding activities (GOSA, LYSOX and LYSAN) were extracted 

20 from the COS-1 supernatant by passing over the Lysozyme 

affinity matrix. After acid elution all three bispecific 
binding activities (GOSA, LYSOX and LYSAN) were recovered 
from the column. Since both Golysan.A and B were 
affinity purified based on the ability to bind to 

25 Lysozyme, the finding that these molecules also bind 

S. sanguis and Glucose Oxidase shows that all three 
binding activities are present in one self assembling 
molecular complex. 

3 0 Preparation 1. 

Construction of the pGOSA.E double head expression vector 

In the pGOSA expression vectors, the DNA fragments 
encoding both the V H and V t of the antibody are preceded 
35 by a ribosome binding site and a DNA sequence encoding 
the pelB signal sequence in an artificial dicistronic 
operon under the control of a single inducible promoter. 
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Expression of these constructs is driven by the inducible 
lacZ promoter. The nucleotide sequence of the ffindlll- 
EcoRI inserts of the plasmids pUR.4124 (SEQ ID NO. 23), 
Fv.3418 (SEQ ID NO. 24), Fv.4715-myc (SEQ ID NO. 25) and 
5 scFv.4715-myc (SEQ ID NO. 26) constructs used for the 

generation of the bispecific antibody fragments are given 
in Figures 12-15, respectively. Moreover, a culture of 
E. coli cells harbouring plasmid scFv.4715-myc and a 
culture of E. coli cells harbouring plasmid Fv.3418 were 
10 deposited under the Budapest Treaty at the National 

Collection of Type Cultures (Central Public Health 
Laboratory) in London (United Kingdom) with deposition 
numbers NCTC 12916 and NCTC 12 915, respectively. 

15 In agreement with Rule 28 (4) EPC, or a similar 

arrangement for a State not being a Contracting State of 
the EPC, it is hereby requested that a sample of such 
deposit, when requested, will be submitted to an expert 
only. 

20 

The construction of pGOSA.E (see Figure 16 for the 
Hindlll-EcoRI insert of pUC19) involved several cloning 
steps. The appropriate restriction sites in the various 
domains were introduced by PCR directed mutagenesis using 
25 the oligonucleotides listed in Table 1 below. 

The construction of pGOSA.E involved several cloning 
steps that produced 4 intermediate constructs pGOSA.A to 
pGOSA.D (see Figures 17-21) . The final expression vector 

30 pGOSA.E and the oligonucleotides in Table 1 have been 

designed to enable most specificities to be cloned into 
the final pGOSA.E construct (Figure 16c) . The upstream V H 
domain can be replaced by any Pstl-BstEII V H gene fragment 
obtained with oligonucleotides PCR. 51 and PCR. 89 (see 

35 Table 1) . The oligonucleotides DBL.l and DBL.2 (see 
Table 1) were designed to introduce Sfil and Nhel 
restriction sites in the V H gene fragments thus allowing 
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cloning of those V H gene fragments into the Sfil-Nhel 
sites as the downstream V H domain. All V L gene fragments 
obtained with oligonucleotides PCR.116 and PCR.90 (see 
Table 1) can be cloned into the position of the V L .3418 
5 gene fragment as a SacI-.YhoI fragment. A complication 

here however is the presence of an internal Sad site in 
the V H .3418 gene fragment. Oligonucleotides DBL.3 and 
DBL.9 (see Table 1) are designed to allow cloning of V L 
gene fragments into the position of the V L .4715 gene 
10 fragment as a Sall-i\fotl fragment. 

pGOSA.A 

This plasmid is derived from both the Fv.4715-myc 
construct (SEQ ID NO. 25) and the scFv. 4715-myc construct 

15 (SEQ IN NO. 26) . An Sfil restriction site was introduced 

between the DNA sequence encoding the (Gly 4 Ser) 3 linker 
and the gene fragment encoding the V L of the scFv. 4715-myc 
construct (see Figure 17) . This was achieved by 
replacing the BstEII-SacT fragment of the latter 

20 construct by the fragment PCR-I BstEII/SacI (Figure 22) 

that contains an Sfil site between the DNA encoding the 
(Gly 4 Ser) 3 linker and the V L .4715 gene fragment. The 
introduction of the Sfil site also introduced 4 
additional amino acids (AlaGlySerAla) between the 

25 (Gly 4 Ser) 3 linker and V L .4715 resulting in a 

(Gly<Ser) 3 AlaGlySerAla linker (linkerA) . The 
oligonucleotides used to produce PCR-I (DBL.5 and DBL.7, 
see Table 1) were designed to match the sequence of the 
framework- 3 region of V H .4715 and to prime at the junction 

30 of the DNA encoding the (Gly 4 Ser) 3 linker and the V L .4715 

gene fragment, respectively. Thus pGOSA.A can be 
indicated as : 

pel B-V a 47 15 -linkerA- (Sfil) -V L 4715-myc . 

35 pGOSA.B 

This plasmid is derived from plasmid Fv.3418 (see Figure 
18). The XhoI-EcoRI fragment of plasmid Fv.3418 
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comprising the 3' end of DNA encoding framework-4 of the 
V L including the stop codon was removed and replaced by 
the fragment PCR-IV XhoI/EcoRI (Figure 23) . The 
oligonucleotides used to produce PCR-IV (DBL.8 and DEL. 6, 
5 see Table 1) were designed to match the sequence at the 

junction of the V L and the (Gly 4 Ser) 3 linker perfectly 
(DBL.8) , and to be able to prime at the junction of the 
(Gly 4 Ser) 3 linker and the V H in pUR.4124 (DBL.6). DBL.6 
removed the PstI site in the V H (silent mutation) and 
10 introduced a Sail restriction site at the junction of the 

(Gly 4 Ser) 3 linker and the V H , thereby replacing the last 
Ser of the linker by a Val residue resulting in a 
(Gly 4 Ser) 2 Gly 4 Val linker (linkerV) . Thus pGOSA.B can be 
indicated as: 

15 pelB-V H 3418 + pelB-V L 3418-linkerV- (Sall-EcoRI) . 

pGOSA.C 

This plasmid contains DNA encoding V H .4715 linked by the 
(Gly 4 Ser) 3 AlaGlySerAla linker to V H .3418 (see Figure 19), 
2 0 thus: 

pelB-V K 4715-linkerA-V H 3418. 

This construct was obtained by replacing the Sfil-EcoRI 
fragment from pGOSA.A encoding V L .4715 by the fragment 

25 PCR-II Sfil/EcoRI containing the V H .3418 gene. The 

oligonucleotides used to produce PCR-II (DBL.l and DBL.2, 
see Table 1) hybridize in the framework- 1 and framework-4 
region of the gene encoding V H .3418, respectively. DBL.l 
was designed to remove the PstI restriction site (silent 

30 mutation) and to introduce an Sfil restriction site 

upstream of the V H gene. DBL.2 destroys the BstEII 
restriction site in the framework-4 region and introduces 
an Nhel restriction site downstream of the stopcodon. 

35 pGOSA.D 

This plasmid contains a dicistronic operon comprising the 
V H .3418 gene and DNA encoding V L .3418 linked by the 
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(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 20), thus: 
pelB-V H 3418 + pelB-V Ii 3418-linkerV-V L 4715 . 

This construct was obtained by digesting plasmid pGOSA.B 
5 with Sall-EcoRI and inserting the fragment PCR-V 

Sall/EcdRI (Figure 24) containing the V L .4715 gene. The 
oligonucleotides used to obtain PCR-V (DBL.3 and DBL.9, 
see Table 1) were designed to match the nucleotide 
sequence of the framework- 1 and framework- 4 regions of 

10 the V L .4715 gene, respectively. DBL.3 removed the SacI 

site from the framework- 1 region (silent mutation) and 
introduced a Sail restriction site upstream of the V L .4715 
gene. DBL.9 destroyed the Xhol restriction site in the 
framework-4 region of the V L .4715 gene (silent mutation) 

15 and introduced a NotI and an jEcoRI restriction site 

downstream of the stop codon. 

pGOSA.E 

This plasmid contains a dicistronic operon comprising DNA 
20 encoding V H .4715 linked by the (Gly 4 Ser) 3 AlaGlySerAla 

linker to V K .3418 plus DNA encoding V L .3418 linked by the 
(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 21), thus: 
pel B-V H 4715-linkerA-V H 3418 + pelB-V L 3418-linkerV-V L 4715 . 

25 Both translational units are preceded by a ribosome 

binding site and DNA encoding a pelB leader sequence. 
This plasmid was obtained by a three-point ligation by 
mixing the vector resulting from pGOSA.D after removal of 
the V H 3418 -encoding Pstl-SacI insert with the Pstl-Miel 

30 pGOSA.C insert containing V H .4715 linked to V H .3418 and 

the PCR-III Nhel/SacI fragment (see Figure 25) . The 
remaining Pstl-SacI pGOSA.D vector contains the 5' end of 
the framework- 1 region of V H .3418 upto the PstI 
restriction site and V L .3418 linked by the 

35 (Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 starting from the Sad 

restriction site in V L .3418. The Pstl-Miel pGOSA.C insert 
contains V H .4 715 linked by the (Gly 4 Ser) 3 AlaGlySerAla 



WO 97/38102 



PCT/EP97/01609 



- 30 - 

linker to V H .3418, starting from the PstI restriction site 
in the framework-1 region in V H .4715. The Nhel-SacI PCR- 
III fragment provides the ribosome binding site and DNA 
encoding the pelB leader sequence for the V L .3418- 
5 (Gly 4 Ser) 2 Gly 4 Val-V L .4715 construct. The oligonucleotides 

DBL.10 and PCR.116 (see Table 1) used to generate PCR-III 
were designed to match the sequence upstream of the 
ribosome binding site of V L .4715 in Pv.4715 and to 
introduce an Nhel restriction site (DBL.10), and to match 
10 the framework-4 region of V L .3418 (PCR.116) . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT : 

(A) NAME: UNILEVER PLC 

(B) STREET: Blackfriars 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : EC4P 4BQ 

(G) TELEPHONE : (01234) 222644 

(H) TELEFAX: (01234) 222633 

(I) TELEX: 8222 9 UNILAB G 



(ii) TITLE OF INVENTION: Multivalent and multispecif ic antigen-binding 
protein 



(iii) NUMBER OF SEQUENCES: 2 7 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: 

Patentln Release #(1.0, Version #1.25 (EPO) (SEC ID NO. 1 to 18) 
Patentln Release #1.0, Version #1.30 (EPO) (SEC ID NO. 19 to 27) 



(2) INFORMATION FOR SEQ ID NO : 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



AAGCTTGCAT GGAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 6 0 



GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCAGCGA TGGCCCAGGT GCAGCTGCAG 12 0 
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GAGTCAGGGG GAGACTTAGT GAAGCCTGGA GGGTCCCTGA CACTCTCCTG TGCAACCTCT 18 0 

GGATTCACTT TCAGTAGTTA TGCCTTTTCT TGGGTCCGCC AGACCTCAGA CAAGAGTCTG 24 0 

GAGTGGGTCG CAACCATCAG TAGTACTGAT ACTTATACCT ATTATTCAGA CAATGTGAAG 3 00 

GGGCGCTTCA CCATCTCCAG AGACAATGGC AAGAACACCC TGTACCTGCA AATGAGCAGT 36 0 

CTGAAGTCTG AGGACACAGC CGTGTATTAC TGTGCAAGAC ATGGGTACTA TGGTAAAGGC 42 0 

TATTTTGACT ACTGGGGCCA AGGGACCACG GTCACCGTCT CCTCAGGTGG AGGCGGTTCA 480 

GGCGGAGGTG GCTCTGGCGG TGGCGGATCG GCCGGTTCGG CCCAGGTCCA GCTGCAACAG 54 0 

TCAGGACCTG AGCTGGTAAA GCCTGGGGCT TCAGTGAAGA TGTCCTGCAA GGCTTCTGGA 6 00 

TACACATTCA CTAGCTATGT TATGCACTGG GTGAAACAGA AGCCTGGGCA GGGCCTTGAG 660 

TGGATTGGAT ATATTTATCC TTACAATGAT GGTACTAAGT ACAATGAGAA GTTCAAAGGC 72 0 

AAGGCCACAC TGACTTCAGA CAAATCCTCC AGCACAGCCT ACATGGAGCT CAGCAGCCTG 780 

ACCTCTGAGG ACTCTGCGGT CTATTACTGT TCAAGACGCT TTGACTACTG GGGCCAAGGG 84 0 

ACCACCGTCA CCGTCTCCTC ATAATAAGCT AGCGGAGCTG CATGCAAATT CTATTTCAAG 9 00 

GAGACAGTCA TAATGAAATA CCTATTGCCT ACGGCAGCCG CTGGATTGTT ATTACTCGCT 960 

GCCCAACCAG CGATGGCCGA CATCGAGCTC ACCCAGTCTC CATCTTCCAT GTATGCATCT 102 0 

CTAGGAGAGA GAATCACTAT CACTTGCAAG GCGAGTCAGG ACATTAATAC CTATTTAACC 108 0 

TGGTTCCAGC AGAAACCAGG GAAATCTCCC AAGACCCTGA TCTATCGTGC AAACAGATTG 114 0 

CTAGATGGGG TCCCATCAAG GTTCAGTGGC AGTGGATCTG GGCAAGATTA TTCTCTCACC 12 0 0 

ATCAGCAGCC TGGACTATGA AGATATGGGA ATTTATTATT GTCTACAATA TGATGAGTTG 12 6 0 

TACACGTTCG GAGGGGGGAC CAAGCTCGAG ATCAAACGGG GTGGAGGCGG TTCAGGCGGA 13 2 0 

GGTGGCTCTG GCGGTGGCGG AGTCGACATC GAACTCACTC AGTCTCCATT CTCCCTGACT 13 8 0 

GTGACAGCAG GAGAGAAGGT CACTATGAAT TGCAAGTCCG GTCAGAGTCT GTTAAACAGT 144 0 
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GTAAATCAGA GGAACTACTT GACCTGGTAC CAGCAGAAGC CAGGGCAGCC TCCTAAACTG 1500 

TTGATCTACT GGGCATCCAC TAGGGAATCT GGAGTCCCTG ATCGCTTCAC AGCCAGTGGA 156 0 

TCTGGAACAG ATTTCACTCT CACCATCAGC AGTGTGCAGG CTGAAGACCT GGCAGTTTAT 162 0 

TACTGTCAGA ATGATTATAC TTATCCGTTC ACGTTCGGAG GGGGGACCAA GCTCGAAATC 168 0 

AAACGGGGAT CCGGTAGCGG GAACTCCGGT AAGGGGTACC TGAAGTAATA AGCGGCCGCG 174 0 

AATTC 1745 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 6 0 

GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCAGCGA TGGCCCAGGT GCAGCTGCAG 12 0 

GAGTCAGGAC CTGGCCTGGT GGCGCCCTCA CAGAGCCTGT CCATCACATG CACCGTCTCA 18 0 

GGGTTCTCAT TAACCGGCTA TGGTGTAAAC TGGGTTCGCC AGCCTCCAGG AAAGGGTCTG 24 0 

GAGTGGCTGG GAATGATTTG GGGTGATGGA AACACAGACT AT AATTC AG C TCTCAAATCC 3 00 

AGACTGAGCA TCAGCAAGGA CAACTCCAAG AGCCAAGTTT TCTTAAAAAT GAACAGTCTG 36 0 

CACACTGATG ACACAGCCAG GTACTACTGT GCCAGAGAGA GAGATTATAG GCTTGACTAC 42 0 

TGGGGCGAAG GCACCACGGT CACCGTCTCC TCAGGTGGAG GCGGTTCAGG CGGAGGTGGC 48 0 

TCTGGCGGTG GCGGATCGGA CATCGAGCTC ACCCAGTCTC CAGCCTCCCT TTCTGCGTCT 54 0 

GTGGGAGAAA CTGTCACCAT CACATGTCGA GCAAGTGGGA ATATTCACAA TTATTTAGCA 600 
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TGGTATCAGC AGAAACAGGG AAAATCTCCT CAGCTCCTGG TCTATTATAC AACAACCTTA 66 0 

GCAGATGGTG TGCCATCAAG GTTCAGTGGC AGTGGATCAG GAACACAATA TTCTCTCAAG 72 0 

ATCAACAGCC TGCAACCTGA AGATTTTGGG AGTTATTACT GTCAACATTT TTGGAGTACT 78 0 

CCTCGGACGT TCGGTGGAGG CACCAAGCTC GAGATCAAAC GGGAACAAAA ACTCATCTCA 84 0 

GAAGAGGATC TGAATTAATA AGATCAAACG GTAATAAGGA TCCAGCTCGA ATTC 8 94 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 6 0 

GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCGGCCA TGGCCCAGGT GCAGCTGCAG 12 0 

GAGTCAGGGG GAGACTTAGT GAAGCCTGGA GGGTCCCTGA CACTCTCCTG TGCAACCTCT 18 0 

GGATTCACTT TCAGTAGTTA TGCCTTTTCT TGGGTCCGCC AGACCTCAGA CAAGAGTCTG 24 0 

GAGTGGGTCG CAACCATCAG TAGTACTGAT ACTTATACCT ATTATTCAGA CAATGTGAAG 3 00 

GGGCGCTTCA CCATCTCCAG AGACAATGGC AAGAACACCC TGTACCTGCA AATGAGCAGT 36 0 

CTGAAGTCTG AGGACACAGC CGTGTATTAC TGTGCAAGAC ATGGGTACTA TGGTAAAGGC 42 0 

TATTTTGACT ACTGGGGCCA AGGGACCACG GTCACCGTCT CCTCAGGTGG AGGCGGTTCA 48 0 

GGCGGAGGTG GCTCTGGCGG TGGCGGATCG GACATCGAGC TCACTCAGTC TCCATTCTCC 54 0 

CTGACTGTGA CAGCAGGAGA GAAGGTCACT ATGAATTGCA AGTCCGGTCA GAGTCTGTTA 6 00 

AACAGTGTAA ATCAGAGGAA CTACTTGACC TGGTACCAGC AGAAGCCAGG GCAGCCTCCT 66 0 
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AAACTGTTGA TCTACTGGGC ATCCACTAGG GAATCTGGAG TCCCTGATCG CTTCACAGCC 72 0 

AGTGGATCTG GAACAGATTT CACTCTCACC ATCAGCAGTG TGCAGGCTGA AGACCTGGCA 78 0 

GTTTATTACT GTCAGAATGA TTATACTTAT CCGTTCACGT TCGGAGGGGG GACCAAGCTC 84 0 

GAGATCAAAC GGGGATCCGG TAGCGGGAAC TCCGGTAAGG GGTACCTGAA GTAATAAGAT 9 00 

CAAACGGTAA TAAGGATCCA GCTCGAATTC 93 0 
(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

CCATGGGATG GAGCTGTATC ATCCTCTTCT TGGTAGCAAC AGCTACAGGT AAGGGGCTCA 6 0 

CAGTAGCAGG CTTGAGGTCT GGACATATAT ATGGGTGACA ATGACATCCA CTTTGCCTTT 12 0 

CTCTCCACAG GTGTCCACTC CCAGGTCCAA CTGCAG 156 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
AGGTSMAMCT GCAGSAGTCW GG 2 2 

(2) INFORMATION FOR SEQ ID NO : 6: 



WO 97/38102 



- 36 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TGAGGAGACG GTGACCGTGG TCCCTTGGCC CC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACATTGAGC TCACCCAGTC TCCA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GTTAGATCTC GAGCTTGGTC CC 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAGGATCCGG CCGGTTCGGC CCAGGTCCAG CTGCAACAGT CAGGA 4 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTACATGAAT TCGCTAGCTT ATTATGAGGA GACGGTGACG GTGGTCCCTT GGC 53 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATTGGAGTCG ACATCGAACT CACTCAGTCT CCATTCTCC 3 9 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGAATTCGGA TCCCCGTTTG ATTTCGAGCT TGGTCC 3 6 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 5 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAGCGCGAGC TCGGCCGAAC CGGCCGATCC GCCACCGCCA GAGCC 4 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATTGTCGAAT TCGTCGACTC CGCCACCGCC AGAGCC 3 6 

(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AGCTT CTAGA CCACCATGGA AAACTGCAGA GCTCAAAAGC TAGCGCGGCG GCTCTAG 57 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AATTCTAGAG CGGCCGCGCT AGCTTTTGAG CTCTGCAGTT TTCCATGGTG GTCTAGA 5 7 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ACGGGTGAGC TCGATGTCGG AGTGGACACC TGTGGAGAGA 4 0 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGAAACAGCT ATGACCATGA TTAC 2 4 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL.7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CACCATCTCC AGAGACAATG GCAAG 2 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL . 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ACCAAGCTCG AGATCAAACG GGG 2 3 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 



WO 97/38102 



PCT/EP97/01609 



- 41 - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: primer DBL . 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGAAGTGAAT TCGCGGCCGC TTATTACCGT TTGATTT CGA GCTTGGTCCC 50 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: primer DBL. 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TAATAAGCTA GCGGAGCTGC ATGCAAATTC TATTTC 36 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " cDNA domains with synthetic linker(s)" 



(vii) IMMEDIATE SOURCE: 
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(B) CLONE: EcoRI-Hindlll insert of pUR4124 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 11. .730 

(D) OTHER INFORMATION :/product= " VLlys-GS- VHlys" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 11 . .334 

(D) OTHER INFORMATION : /product = "VLlys" 

(ix) FEATURE: 

(A) NAME /KEY : miBC_RNA 

(B) LOCATION:335 . .379 

(D) OTHER INFORMATION : /product= " (Gly4Ser) 3 linker" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

( B) LOCATION: 38 0. .727 

(D) OTHER INFORMATION: /product- "VHlys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GAATTCGGCC GAC ATC GAG CTC ACC CAG TCT CCA GCC TCC CTT TCT GCG 4 9 

Asp He Glu Leu Thr Gin Ser Pro Ala Ser Leu Ser Ala 
15 10 

TCT GTG GGA GAA ACT GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT 9 7 

Ser Val Gly Glu Thr Val Thr He Thr Cys Arg Ala Ser Gly Asn He 
15 20 25 

CAC AAT TAT TTA GCA TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG 14 5 

His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lye Ser Pro Gin 
30 35 40 45 

CTC CTG GTC TAT TAT ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG 193 
Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg 
50 55 60 

TTC AGT GGC AGT GGA TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC 241 
Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser 
65 70 75 
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CTG CAA CCT GAA GAT TTT GGG AGT TAT 
Leu Gin Pro Glu Asp Phe Gly Ser Tyr 
80 85 

ACT CCT CGG ACG TTC GGT GGA GGG ACC 
Thr Pro Arg Thr Phe Gly Gly Gly Thr 
95 100 

GGA GGC GGT TCA GGC GGA GGT GGC TCT 
Gly Gly Gly Ser Gly Gly Gly Gly Ser 
110 115 

CAG CTG CAG GAG TCA GGA CCT GGC CTG 
Gin Leu Glr. Glu Ser Gly Pro Gly Leu 
130 

TCC ATC ACA TGC ACC GTC TCA GGG TTC 
Ser He Thr Cys Thr Val Ser Gly Phe 
145 150 
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TAC TGT CAA CAT TTT TGG AGT 289 
Tyr Cys Gin His Phe Trp Ser 
90 

AAG CTC GAG ATC AAA CGG GGT 33 7 

Lys Leu Glu He Lys Arg Gly 
105 

GGC GGT GGC GGA TCG CAG GTG 385 
Gly Gly Gly Gly Ser Gin Val 
120 125 

GTG GCG CCC TCA CAG AGC CTG 43 3 

Val Ala Pro Ser Gin Ser Leu 
135 140 

TCA TTA ACC GGC TAT GGT GTA 4 81 

Ser Leu Thr Gly Tyr Gly Val 
155 



AAC TGG GTT CGC CAG CCT CCA GGA AAG GGT CTG GAG TGG CTG GGA ATG 
Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met 
160 165 170 

ATT TGG GGT GAT GGA AAC ACA GAC TAT AAT TCA GCT CTC AAA TCC AGA 
He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser Arg 
175 180 185 

CTG AGC ATC AGC AAG GAC AAC TCC AAG AGC CAA GTT TTC TTA AAA ATG 
Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lys Met 
190 195 200 205 

AAC AGT CTG CAC ACT GAT GAC ACA GCC AGG TAC TAC TGT GCC AGA GAG 
Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg Glu 
210 215 220 

AGA GAT TAT AGG CTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC 
Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val 
225 230 235 

TCC TCA TGA T AAG CTT 
Ser Ser * 
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240 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " cDNA domains with synthetic linker(e)" 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: Hindlll-EcoRI insert Fv.3418 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION:36 . .443 

( D) OTHER INFORMATION : /product= "pelB- VH3418 " 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION : 36 .. 101 

(D) OTHER INFORMATION: /product = "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION:102 . .440 

(D) OTHER INFORMATION: /product= "VH3418" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION:495. .884 

(D) OTHER INFORMATION : /product= "pelB- VL43 18 " 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION:495 . . 560 

(D) OTHER INFORMATION: /product= "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 561. .881 
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(D) OTHER INFORMATION: /product = "VL3418" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAGCTTGCAA ATTCTATTTC AAGGAGACAG TCATA ATG AAA TAC CTA TTG CCT 53 

Met Lys Tyr Leu Leu Pro 
-22 -20 

ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC 101 
Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala 
-15 -10 -5 

CAG GTG CAG CTG CAG CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT 14 9 

Gin Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala 
IS 10 15 

TCA GTG AAG ATG TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT 197 
Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 30 

GTT ATG CAC TGG GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT 24 5 

Val Met His Trp Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp lie 
35 40 45 

GGA TAT ATT TAT CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC 2 93 

Gly Tyr lie Tyr Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe 

50 55 60 

AAA GGC AAG GCC ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC 341 
Lys Gly Lys Ala Thr Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

ATG GAG CTC AGC AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT 38 9 

Met Glu Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

TCA AGA CGC TTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC TCC 43 7 

Ser Arg Arg Phe Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser 
100 105 110 

TCA TAA TAAGAGCTAT GGGAGCTTGC ATGCAAATTC TATTTCAAGG AGACAGTCAT 4 93 

Ser * 

A ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC 53 9 
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Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu 



GCT GCC CAA CCA GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT 
Ala Ala Gin Pro Ala Met Ala Aep He Glu Leu Thr Gin Ser Pro Ser 



TCC ATG TAT GCA TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG 
Ser Met Tyr Ala Ser Leu Gly Glu Arg He Thr He Thr Cys Lys Ala 



AGT CAG GAC ATT AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG 
Ser Gin Asp He Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Gly 



AAA TCT CCC AAG ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG 
Lys Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly 



GTC CCA TCA AGG TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC 
Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu 



ACC ATC AGC AGC CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA 
Thr He Ser Ser Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu 



CAA TAT GAT GAG TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC 
Gin Tyr Asp Glu Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He 
90 95 100 105 

AAA CGG TAA TAATGATCAA ACGGTATAAG GATCCAGCTC GAATTC 
Lys Arg * 

(2) INFORMATION FOR SEQ ID NO: 2 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker(s)" 
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i) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll -EcoRI insert of Fv.4715-myc 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION :4 0 . .468 

(D) OTHER INFORMATION :/product= "pelB-VH4715" 



(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION :4 0 . . 105 

(D) OTHER INFORMATION: /product = "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION:106 . .465 

(D) OTHER INFORMATION: /product = "VH4715" 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION : 520 .. 963 

(D) OTHER INFORMATION : /products "pelB-VL4715-myc" 

(ix) FEATURE: 

(A) NAME /KEY : eig_peptide 

(B) LOCATION:520 . .585 

(D) OTHER INFORMATION: /product = "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 586 . . 927 

(D) OTHER INFORMATION: /product = "VL4715" 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 928 .. 96 0 

(D) OTHER INFORMATION : /product = "myc-tag" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 

Met Lys Tyr Leu Leu 
-22 -20 
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CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 

Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
-15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 

Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lys Pro Gly 



GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 



TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 



GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 
Val Ala Thr He Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Aen 



GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 
Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lys Asn Thr Leu 



TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 



TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA TAA TAAGAGCTAT GGGAGCTTGC 
Gin Gly Thr Thr Val Thr Val Ser Ser * 
115 120 

ATGCAAATTC TATTTCAAGG AGACAGTCAT A ATG AAA TAC CTA TTG CCT ACG 
Met Lys Tyr Leu Leu Pro Thr 



GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC GAC 5 88 

Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Asp 
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ATC GAG CTC ACT CAG TCT CCA TTC TCC CTG ACT GTG ACA GCA GGA GAG 
He Glu Leu Thr Gin Ser Pro Phe Ser Leu Thr Val Thr Ala Gly Glu 



AAG GTC ACT ATG AAT TGC AAG TCC GGT CAG AGT CTG TTA AAC AGT GTA 684 
Lys Val Thr Met Asn Cye Lys Ser Gly Gin Ser Leu Leu Asn Ser Val 
20 25 30 



AAT CAG AGG AAC TAC TTG ACC TGG TAC CAG CAG AAG CCA GGG CAG CCT 732 
Asn Gin Arg Asn Tyr Leu Thr Trp Tyr Gin Gin Lys Pro Gly Gin Pro 
35 40 45 



CCT AAA CTG TTG ATC TAC TGG GCA TCC 
Pro Lys Leu Leu He Tyr Trp Ala Ser 
50 55 

GAT CGC TTC ACA GCC AGT GGA TCT GGA 
Asp Arg Phe Thr Ala Ser Gly Ser Gly 
70 

AGC AGT GTG CAG GCT GAA GAC CTG GCA 
Ser Ser Val Gin Ala Glu Asp Leu Ala 
85 90 



ACT AGG GAA TCT GGA GTC CCT 78 0 

Thr Arg Glu Ser Gly Val Pro 
60 65 

ACA GAT TTC ACT CTC ACC ATC 82 8 

Thr Asp Phe Thr Leu Thr He 
75 80 

GTT TAT TAC TGT CAG AAT GAT 87 6 

Val Tyr Tyr Cys Gin Asn Asp 

95 



TAT ACT TAT CCG TTC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA 
Tyr Thr Tyr Pro Phe Thr Phe Gly Gly Gly Thr LyB Leu Glu He Lys 
100 105 110 



CGG GAA CAA AAA CTC ATC TCA GAA GAG GAT CTG AAT TAA TAAGATCAAA 973 
Arg Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn * 
115 120 125 



CGGTAATAAG GATCCAGCTC GAATTC 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " cDNA domains with synthetic linker(s)" 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-EcoRI insert of scFv . 4 71 5 -myc 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION:40. .105 

(D) OTHER INFORMATION :/product= "pectate lyase" 

(ix) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 106 . .465 

(D) OTHER INFORMATION :/product= "VH4715" 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_RNA 

(B) LOCATION:466 . . 510 

(D) OTHER INFORMATION : /product = "( Gly4Ser) 3 - linker " 

(ix) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION : 511 . .852 

(D) OTHER INFORMATION: /product = "VL4715" 

(ix) FEATURE: 

(A) NAME /KEY : misc_RNA 

(B) LOCATION : 8 53 . . 885 

(D) OTHER INFORMATION : /product = "myc-tag" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION:40 . .888 

(D) OTHER INFORMATION: /product = 



"pelB-VH4715- (Gly4Ser) 3-VL4715-myc" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 

Met Lys Tyr Leu Leu 
-22 -20 



54 



CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
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GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lys Pro Gly 



GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 



TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 



GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 
Val Ala Thr lie Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 



GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 
Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Gly Lys Asn Thr Leu 



TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 no 

CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 

GGT GGC TCT GGC GGT GGC GGA TCG GAC ATC GAG CTC ACT CAG TCT CCA 
Gly Gly Ser Gly Gly Gly Gly Ser Asp lie Glu Leu Thr Gin Ser Pro 
130 135 140 

TTC TCC CTG ACT GTG ACA GCA GGA GAG AAG GTC ACT ATG AAT TGC AAG 
Phe Ser Leu Thr Val Thr Ala Gly Glu Lys Val Thr Met Asn Cys Lys 
145 150 155 



TCC GGT CAG AGT CTG TTA AAC AGT GTA AAT CAG AGG AAC TAC TTG ACC 
Ser Gly Gin Ser Leu Leu Asn Ser Val Asn Gin Arg Asn Tyr Leu Thr 
1G0 165 170 175 



630 
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TGG TAC CAG CAG AAG CCA GGG CAG CCT CCT AAA CTG TTG ATC TAC TGG 
Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lye Leu Leu lie Tyr Trp 
180 185 190 



GCA TCC ACT AGG GAA TCT GGA GTC CCT GAT CGC TTC ACA GCC AGT GGA 72 6 

Ala Ser Thr Arg Glu Ser Gly Val Pro Asp Arg Phe Thr Ala Ser Gly 
195 200 205 



TCT GGA ACA GAT TTC ACT CTC ACC ATC AGC AGT GTG CAG GCT GAA GAC 774 
Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Val Gin Ala Glu Asp 
210 215 220 



CTG GCA GTT TAT TAC TGT CAG AAT GAT TAT ACT TAT CCG TTC ACG TTC 822 
Leu Ala Val Tyr Tyr Cys Gin Asn Asp Tyr Thr Tyr Pro Phe Thr Phe 
225 230 235 



GGA GGG GGG ACC AAG CTC GAG ATC AAA CGG GAA CAA AAA CTC ATC TCA 87 0 

Gly Gly Gly Thr Lys Leu Glu lie Lye Arg Glu Gin Lys Leu He Ser 
240 245 250 255 



GAA GAG GAT CTG AAT TAA TAAGATCAAA CGGTAATAAG GATCCAGCTC GAATTC 924 
Glu Glu Asp Leu Asn * 
260 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " cDNA domains with synthetic linker(s)" 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: HindHI-EcoRI insert of pGOSA . E 



(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION:40. .864 

(D) OTHER INFORMATION :/product= "pelB- VH4 715 -LiA- VH341 8 " 



(ix) FEATURE : 
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(A) NAME/KEY: eig_peptide 

(B) LOCATIONS . . 105 

(D) OTHER INFORMATION :/product= "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_pept ide 

(B) LOCATION : 106 . .4 65 

(D) OTHER INFORMATION: /product- "VH4715" 

(ix) FEATURE : 

(A) NAME /KEY : misc_RNA 

(B) LOCATION: 46 6. .522 

(D) OTHER INFORMATION: /product = "linkerA 
(Gly4Ser) 3AlaGlySerAla " 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION : 523 .. 861 

(D) OTHER INFORMATION : /product= "VH3418" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 913 .. 1689 

(D) OTHER INFORMATION: /product = "pelB- VL34 18 -Li V-VL4 715 " 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 913 .. 978 

(D) OTHER INFORMATION: /product = "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION:979. .1299 

(D) OTHER INFORMATION :/product= "VL3418" 

(ix) FEATURE: 

(A) NAME /KEY : misc_RNA 

(B) LOCATION : 1300 .. 1344 

(D) OTHER INFORMATION: /product= "linker V 
<Gly4Ser)2Gly4Val" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1345 .. 16 8 6 
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(D) OTHER INFORMATION : /products "VL4715" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AAGCTTGCAT GGAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 

Met Lye Tyr Leu Leu 



CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 

Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 

-15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 

Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lye Pro Gly 



GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 



TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Aep Lye Ser Leu Glu Trp 



GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 
Val Ala Thr He Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 



GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 
Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lys Asn Thr Leu 



TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 



TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 
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GGT GGC TCT GGC GGT GGC GGA TCG GCC GGT TCG GCC CAG GTC CAG CTG 
Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Ser Ala Gin Val Gin Leu 
130 135 140 

CAA CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT TCA GTG AAG ATG 
Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala Ser Val Lys Met 
145 150 155 

TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT GTT ATG CAC TGG 
Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Val Met His Trp 
ISO 165 170 175 

GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT GGA TAT ATT TAT 
Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp lie Gly Tyr He Tyr 
180 185 190 

CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC AAA GGC AAG GCC 
Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala 
195 200 205 

ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC ATG GAG CTC AGC 
Thr Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr Met Glu Leu Ser 
210 215 220 

AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT TCA AGA CGC TTT 
Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ser Arg Arg Phe 
225 230 235 

GAC TAC TGG GGC CAA GGG ACC ACC GTC ACC GTC TCC TCA TAA 
Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser * 
240 - 245 250 

TAAGCTAGCG GAGCTGCATG CAAATTCTAT TTCAAGGAGA CAGTCATA ATG AAA TAC 

Met Lys Tyr 

-22 -20 

CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA 
Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro 



GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT TCC ATG TAT GCA 
Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ser Ser Met Tyr Ala 
15 10 



1017 
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TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG AGT CAG GAC ATT 
Ser Leu Gly Glu Arg He Thr He Thr Cys Lys Ala Ser Gin Asp He 



AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG AAA TCT CCC AAG 1113 
Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Gly Lys Ser Pro Lys 
30 35 40 45 



ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG GTC CCA TCA AGG 1161 
Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly Val Pro Ser Arg 

50 55 60 



TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC ACC ATC AGC AGC 12 0 9 

Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser Ser 
65 70 75 



CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA CAA TAT GAT GAG 125 7 

Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu Gin Tyr Asp Glu 
80 85 90 



TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA CGG GGT GGA 13 0 5 

Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Gly Gly 
95 100 105 



GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA GTC GAC ATC GAA 1353 
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Val Asp He Glu 
110 115 120 125 



CTC ACT CAG TCT CCA TTC TCC CTG ACT GTG ACA GCA GGA GAG AAG GTC 14 01 

Leu Thr Gin Ser Pro Phe Ser Leu Thr Val Thr Ala Gly Glu Lys Val 
130 135 140 



ACT ATG AAT TGC AAG TCC GGT CAG AGT CTG TTA AAC AGT GTA AAT CAG 144 9 

Thr Met Asn Cys Lys Ser Gly Gin Ser Leu Leu Asn Ser Val Asn Gin 
145 150 155 

AGG AAC TAC TTG ACC TGG TAC CAG CAG AAG CCA GGG CAG CCT CCT AAA 14 9 7 

Arg Asn Tyr Leu Thr Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys 

160 165 170 

CTG TTG ATC TAC TGG GCA TCC ACT AGG GAA TCT GGA GTC CCT GAT CGC 154 5 

Leu Leu He Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val Pro Asp Arg 
175 180 165 



WO 97/38102 



PCT7EP97/01609 



TTC ACA GCC AGT GGA TCT GGA ACA GAT 
Phe Thr Ala Ser Gly Ser Gly Thr Asp 
190 195 

GTG CAG GCT GAA GAC CTG GCA GTT TAT 
Val Gin Ala Glu Asp Leu Ala Val Tyr 
210 

TAT CCG TTC ACG TTC GGA GGG GGG ACC 
Tyr Pro Phe Thr Phe Gly Gly Gly Thr 
225 230 



- 57 - 

TTC ACT CTC ACC ATC AGC AGT 15 93 

Phe Thr Leu Thr He Ser Ser 
200 205 

TAC TGT CAG AAT GAT TAT ACT 1641 
Tyr Cys Gin Asn Asp Tyr Thr 
215 220 

AAG CTC GAA ATC AAA CGG TAA 168 9 

Lys Leu Glu He Lys Arg * 
235 



TAAGCGGCCG CGAATTC 



1706 
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CLAIMS 

1. A multivalent antigen binding protein comprising: 
a first polypeptide comprising, in series, 
three or more variable domains of an antibody 
heavy chain; and 

a second polypeptide comprising, in series, 
three or more variable domains of an antibody 
light chain, 

said first and second polypeptides being linked by 
association of the respective heavy chain and light 
chain variable domains, each associated variable 
domain pair forming an antigen binding site. 

2 . A protein according to Claim 1 comprising a 
trivalent antigen binding protein. 

3 . A protein according to Claim 1 or Claim 2 wherein 
the variable domains of the antibody heavy chain of 
said first polypeptide are linked by a peptide 
linker and the variable domains of the antibody 
light chain of said second polypeptide are linked by 
a peptide linker. 

4 . A protein according to any one of Claims l to 3 
wherein the associated variable domain pair binding 
sites are able to bind different epitopes from each 
other. 

5 . A protein according to any one of Claims 1 to 3 
wherein the associated variable domain pair binding 
sites are able to bind the same epitope as each 
other. 

6 . Nucleotide sequences coding for the polypeptides of 
the multivalent antigen binding protein of any one 
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of the preceding claims. 

7 . Nucleotide sequences according to Claim 6 contained 
in one or more expression vectors. 

8. A host cell transformed with a vector according to 
Claim 7, and capable of expression of the nucleotide 
sequences to produce the polypeptides of the 
multivalent antigen binding protein. 

9 . A host cell according to Claim 8 wherein the 
polypeptides on expression associate to form the 
multivalent antigen binding protein. 

10. A process for preparing a multivalent antigen 
binding protein according to any one of Claims 1 to 
5 comprising 

(i) transforming one or more hosts by 
incorporating genes encoding said first and 
second polypeptides; 

(ii) expressing said genes and said host or 
hosts,- and 

(iii) allowing said first and second 
polypeptides to associate to form the protein. 

11. A protein according to any one of Claims 1 to 5 for 
use in medicine. 

12. A diagnostic or therapeutic composition comprising a 
protein according to any one of Claims 1 to 5 . 

13. Use of composition according to Claim 12 in the 
preparation of an agent for use in diagnosis or 
therapy. 



14 . 



A method of diagnosis or therapy comprising 
administering a protein according to any one of 
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Claims 1 to 5 . 



Use of a protein according to any one of Claims 1 to 
5 in an immunoassay method or for purification. 
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Fig.2A. 



AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 



pelB 
leader 



AAAGLLLLAAQPAMA Q y Q - £ , 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCISCAS 

GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 

rF~g- ■ T-- y; - s, -s ■;- y : A:-ii-; ;:vS w V P. 0 T ^L j2_i£_csS3 

GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCAGACCTCAGACAAGAGTCTG 



:;^:j:'.A : :::.^ili : ; :s:: 



pelB 
leader 



GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGACAATGTGAAG 

F g . :,r :. ~ v f~ : 'p y ; j .: :: : :ig p - r» w <T~k « : ; > ■ . l q h" s si 

GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCT3~Y CC1 1 ; rAAATGAGCAGT 



_jl _±_. yj i_j£_ _j:__a_; 



CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 

LTZ *_ ^ ..~^lA~"Q~~g~" : r "v " t T- "s""~<?; i • ~i] 

TATTTTGACTACTGGGGCCAAGGGACCACGGTCACCGTCTCCTCAGGTGGAGGCGGTTCA 

|gg ggsggggsag" s~a il Q " ^ C CT] 

GGCGGAGGTGGCTCTGGCGGTGGCGGATCgCCCCgTTCGGf:CCAGGTCCAG--T-3r/AC\G 



TCAGGACCTGAGCTGGTAAAGCCTGGGGCTTCAGTGAAGATGTCCTGC^~?30TT r CT3G^ 



T Xl'T 71 ? g" Y M ?' ~W V ■ ' K ■ :Q ~¥ £ C Q fi "" "V" E 

TACACATTCACTAGCTATGTTATGCACTGGGTGAAACAGAAGCCTGGGCAGGGCCTTGAG 

j W:. I- G Y.. I ,- ; Yy-p ; Y a p; J ~tTT JLJZ T." F K G j 
TGGATTGGATATATTTATCCTTACAATGATGGTACTAAGTACAATGAGAAGTTCAAAGGr" 



T vj3 D K 



j> t r £v ... ? » h _ f, s " 



AAGGCCACACTGACTTCAGACAAATCCTCCAGCACAGCCTACATGGAGCTCAGCAGCCTG 

qziiiiiii^^ : q g i 

ACCTCTGAGGACTCTGCGGTCTATTACTGTTCAAGACGCTTTGACTACTGGGGCCAAGGG 
I T T V T V S Sj 

ACCACCGTCACCGTCTCCTCATAATAASCIAG^GGAGCTGCATGCAAATTCTATTTCAAG 

MKYLLPTAAAGLLLLA 
GAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCT 

M A kS ; : ; I 



£ L T Q S P 3 S~W 



x jr ; 



GCCCAACCAGCGATGGCCGACATCGAGCTCACCCAGTCTCCATCTTCCATGTATGCATCT 

j L G E R I T I T C K A G Q D I N T Y L T ! 
CTAGGAGAGAGAATCACTATCACTTGCAAGGCGAGTCAGGACATTAATACCTATTTAACc" 

>M F Q 0 K P O K S P F. TLX Y R A » R L 1 
TGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCAAACAGATTG 
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Fig.2B. 

CTAGATGGGGTCCCATCAAGGTTCAGTGGCAGTGGATCTGGGCAAGATTATTCTCTCACC 

[i:iziii.:^.;^ h — 

ATCAGCAGCCTGGACTATGAAGATATGGGAATTTATTATTGTCTACAATATGATGAGTTG 
TACACGTTCGGAGGGGGGACCAAGCTCGAGATCAAACGGGGTGGAGGCGGTTCAGGCGGA 

Linker 

[GGSGGGGVI LIOZIZIIXIIIIZZI^^^ T1 
GGTGGCTCTGGCGGTGGCGGAGTCGACATCGAACTCACTCAGT'C7?CATTCr^cTfGACl 

ci3io:xx:o:^^ l « - s ,i 

gtgacagcaggagagaaggtcactatgaattgcaagtccggtcagaI^ctgttTIa/^^^t 

GTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGG, } C r U'Aa7\FtO 

VL471 5 [J^TZOO-JZIZ^ 

TTGATCTACTGGGCATCCACTAGGGAA- Jlf 3AGT CCTGATCGCTTCACAGCCAGTGGA 

TCTGGAACAGATTTCACTCTCACCATCAGCAGTGTr CAGGCTG AAC AO rG GC * G^TTaY 

TACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGG 1 3C GA( CAAGCTCGAAATC 
Hydro-It | K R | ( gSGSGNSGKGYL ~k ) 

ta 9 AAACGGGGATCCGGTAGCGGGAACTCCGGTAAGGGGTACCTGAAGTAATAA GCGGCCGC G 
AATTC 
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Fig.3A. 

M K Y L L P T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

pelB aaagllllaaqpamaI 
leader gcagccgctggattgttattactcgctgcccaaccagcgatggccc 



GAGTCAGGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCA 



GGGTTCTCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTG 



CI . V . - . >' > « : 

G; ITGGCTGi GAATGATTTGG 



;gtgatggaaacacagactataat' 



VHlys | r Xi s i ' 



S K § 0 V 



agactgagcatcagcaaggacaactccaagagccaagttttcttaaaaatgaacagtctg 



I h t g si t a a -y y c ■ 



— . a__ji._s_r__d l y_r_ 

cacactgatgacacagccaggtactactgtgccagagagagagattataggi 



VLlys 



tggggcgaaggcaccacgj 



7 T V T V 



s g g g G I 



icgtctcctcaggtggaggcggttcaggcggaggtggc 



nrx> i jl s t q s v ~a s m s'~ s 

TCTGGCGGTGGCGGATCGGACAU ,.TC ^ n\rr 1 S^CCTC C t j ' ff tctgcgtct 



GT<JG(i;AGAA7iL'fGTCAC(JATCAcATC 



tggtatcagcagaaacagggaaaatctcctcagctcctggtctattatacaacaacct 



L£_±l. Q y p s, » t? s a s g s s jr y I I PIP] 

gcagatggtgtgccatcaaggttcagtggcagtggatcagga 1 c ac ; TVPvrc7c0cf 



[ 2 v_> "r a p h r g 3 F""¥""T~v": 

atcaacagcctgcaacctgaagattttggga sttattactgt \ ^™7^t?7V:T^ 



Myc-tag 



. . _ ~g) 

cctcggacgttcggtggaggcaccaagcjscg^qatcaaacgggaacaaaaactcatctca 



_±: MM F G__G_ G T K L 



EJ.'c Q K L 



gaagaggatctgaattaataagatcaaacggtaataaggatccagctcgaattc 
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Fig.3B. 

W MKYLLPT 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

pelB AAAGLLLLAAQPAMA Iq V, £ -St^-Qi I 

leader gcagccgctggattgttattactcgctgcccaaccagcgatggcccaog^cagSS 

gagtcagggggagacttagtgaagcctggagggtccctgacactctcctgtgcaacctct 

!IM<s . "V_ 



L E _ * ... £ a !L. A__s_JLJL^^J.„JL. JL_jL '/ "o" n 

gagtgggtcgcaaccatcagtagtactgatacttatacctattai tcagac \A~T~iroA £g 



VH4715 



Lg _ 'Ml j _3 _ * s. p. b u c k WMM lLjtTJni€iT'~$ ! 

GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGC'A.BATn/voCAG 7 :' 

' _ ^ q ^~ ^'^ ^"<"""a ri hIg1y;3~-7. 

CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 

f x eijOOi _ s_^IIII£IZ3^13£rv~" iZi] ' g g g g s I 

TATTTTGACTACTGGGGCCAAGGGACCACGJ2ICACCGTCTCCTCAGGTGGAGGCGGTTCA 
GGCGGAGGTGGCTCTGGCGGTGGCGGATCGGACATCGGl^CjiAGT^ 

I ] CI- ~"Zl?ITIX~irT IHI III >" o " k 5" fj" ~~s ij — i7" • 

CTGACTGTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCCGGTCAGAGTCTGTTA 

\~*t-:s v n q p. m^g^-^ 

AACAGTGTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGGCAGCCTCCT 

VL47 1 5 A ^ CTGTTGATCTAOTGGGCATCCAC ™ 

■" G G 5 Q' ^ 

AGTGGATCTGGAACAGATTTCACTCTCACCATCAGCAGT' GG ■ ?AGGCTGAAGA^CTG^CA 
GTTTATTACTGTCAGAATGATTATACTTATCCGTTCACGTGG : G AGGGG~ G^GAA?;cI: 

Hydro2-tag ; g I K R j 1 SsgsgnsckgylID 

GASATCAAACGGGGATCCGGTAGCGGGAACTCCGGTAAGGGGTACCTGAAGTAATAAGAT 
CAAACGGTAATAAGGATCCAGCTC GAATTC 
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Fig.4. 

EXON 1 



M G W S C I I L- F L V A T A T 

r^j^vggGAT^GA^CTGy-AT C-^TC C TCTTCTTGGT ?lGCAACAGCTACA^g t 



aaggggctcacagtagcaggcttgaggtctggacatatatatgggtgaca 



3 V H S b V Q 
atgacatccactttgcctttctctccacagOT3TCCACTCCdAGGTCCAA 



L Q 
CTGCAG 



EXON 2 



Fig.5. 



Variable region genes 



<J5'UTR 



N 8000 



SV40 L promoter 
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Fig.6A. 



PstI BstEH Sfil Nhel Sacl Xhol Sail BamHI 
—I I I I I II H 



EcoRI 



pel ^mmm£ ^ ^^i^m P elB v L - B IH ) inker vl-a V 



PCR.51 / DBL.1 DBL.2 PCR.90 / DBL.3 DBL 4 

+ PCR.89 + + PCR.116 + 

DBL.1 DBL. 5 DBL.3 DBL. 6 




J 



HindlD 

PstI BstEflSfil BstEE Sfil 

_l U L_L_ 



Fig.6B. 



Nhel Sacl Xhol Sail Xhol Sail BamHI 



linker linker linker linker Hll-tan 

pe lB VH-A / VH-C I VH-B pelBVL^ [ VL-C | VL-A | 9 



— © — 

Col E1 

Ori 



Amp R 
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Fig.7A. 

pelB ^^S^ Iirike r VH-^B pelB VL-C linke r VL-B Hll-tag 



DBL.1 DBL.5 



DBL.3 DBL.6 

r -[ 

Sail Sail 



Amplification 
Reaction 



Amplification 
Reaction 



VH-C linker 



I I il 



VL-C linker 



Digest with Sfil 



Digest with Sail 



Sfil 



VH-C linker 



nz 



■=1 



VL-C linker 
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Fig.7B. 

pel B 'VH-A linke r | VH-B pel B VL-B ; li nke r VL-A H ll-tag 




Sfil 

1 . Open vector with Sfil f 

2. Insert Sfil /Sfil VH-C-linker 
PCR fragment 



llBit 



VH-C linker 



pelB 

Sail 




Sail Sail 



1 . Open vector with Sail 



1 2. Insert Sail / Sail VL-C-linker 
t 1 1 1J=3 PCR fragment 



LacZ^'fSrcSra pel B VL-b'^^Vl-C '""f ^VL-A Hll-t 




ColE1 
Oh 



AmpR 
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Fig.8A. 



Xbal Pstl/Nhel/Xbal 



Ncol PstI 
leader sequence 




Col E1# Euka.VH 
Ori 

AmpR 



Insert Ncol / Pstl leader 
fragment into Euka.pUC 




Euka-DoMinker 

Hindlll/Xbal/NcoI/Pstl/SacI/ 
Nhel/Notl/Xbal/EcoRl , ' 



Euka.pUC 



AmpR 



AmpR 



N col Sa cl 

leader sequence 



Insert Ncol / SacI leader 
fragment into Euka.pUC 
(PCR fragment ML.1/ML.2) 



Xbal Sacl/Notl/Xbal 



Col E1f 
Ori 



AmpR 
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Xbal Pstl/Nhel/Xbal 



Col ElT 
Ori # Euka.VH 



Fig.8B. 




PstI 



AmpR 



VH-A linker VH-B 



Insert VH2 or VH3 Pstl/Nhel 
fragments from pUC constructs into 
PstI / Nhel opened vector 



Nhel 

VH-A linker VH-B linker VH-C 




Excise Xbal /Xbal fragments 
from Euka constructs and insert into 
Xbal opened pSV.51 vector 



pSV.B/pSV.D/pSV.G/pSV.K/pSV.M 
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Fig.8C. 

Xbal Sacl/Notl/Xbal 




VL-A linker VL-B 



Insert VL2-HII or VL3-HII 
EcoRl-Klenow / Sad fragments 
from pUC constructs into 

Noti-Klenow / Sad vector 0 , _ _,„,, 

EcoRI/Klenow 



VL-A / VL-B / VL-C 
linker linker 




Excise Xbal /Xbal fragments 
from Euka constructs and insert into 
Xbal opened pSV.51 vector 



p S V. N/pS V. P/pS V. S/pS V. V/pS V. W 
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< 

I I 

<c to 

Q Q 

JSC 

o o 
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B 




■ LYSOX 

□ gosa 

H LYSAN 
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Fig.12. 



GAATTCX3GCCGACATC£fl££2ZCACCCAGTCTCCAGCCTCCCTTTCTGCGTCTGTGGGAGA 

EoriEiizztO^^^ liiiii] 

AACTGTCACCATCACATGTCGAGCAAGTGGGAATATTCACAATTATTTAGCATGGTATCA 

r"Q"K Q .G K. S P O -L .^..y. Y v ..y :: >T,..?„ L A~ :.<? 
GCAGAAACAGGGAAAATCTCCTCAGCTCCTGGTCTATTATACAACAACCTTAGCAGATGG 

TGTGCCATCAAGGTTCAGTGGCAGTGGATCAGGAACACAATATTCTCTCAAGATCAACAG 

! l :Q ,p edf n ~j "/^/TIT"?.. - y -5- t r" "n~Y_T/ 

CCTGCAACCTGAAGATTTTGGGAGTTATTACTGTCAACATTTTTGGAGTACTCCTCGGAC 

[ G: : G G :T K L E I ■ K Tr \ |GGGGSGGG G sj 
GTTCGGTGGAACCAAGCTQSAGATCAAACGGGGTGGAGGCGGTTCAGGCGGAGGTGGCTC 

Linker 

1 G G G G S |0/V-Q:,;:L,:.QV-E 0- ? -3 ^V>A » ■ 1 

TGGCGGTGGCGGATCGCAGGTGCAGCZSC&GGAGTCAGGACCTGGCCTGGTGGCGCCCTC 

ACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTCTCATTAACCGGCTATGGTGTAAA 

T 1 :AGCCTCC/ \AAGGGT jGAGT GC1 GGAAT VTT ; GTGATG 

VHLyS AAACACAGACTATAATTCAGCTCTCAAATCCAGACTGAGCATCAGCAAGGACAACTCCAA 

GAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACTGATGACACAGCCAGGTACTACTG 

TGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGCCAAGGGACCACGGTCACCGTCTC 

CTCATGAT AAGCTT 
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w MKYLLPTA 
pelB AAGCTTGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAG 

leader 



GLLLLAAQPAMA j Q y Q r, q q "j 
CCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCJZ2CA5CAGT 

E" GPK .LVKPG-ASV T ^^TITF * "liTj.S - -Q 
CAGGACCTGAGCTGGTAAAGCCTGGGGCTTCAGTGAAGATGTCCTGCAAGGCTTCTGGAT 

F V "t'-^s y~v~~m h »w 'T ~7rTT^^^^^£D 

ACACATTCACTAGCTATGTTATGCACTGGGTGAAACAGAAGCCTGGGCAGGGCCTTGAGT 



VH3418 



|W I G Y T Y y Y m . P" G T K : *T F™T"'"'0 " "T™ i 
GGATTGGATATATTTATCCTTACAATGATGGTACTAAGTACAATGAGAAGTTCAAAGGCA 

[k a t i "<r s k $ $ s y YL VjC s ' s It j 

AGGCCACACTGACTTCAGACAAATCCTCCAGCACAGCCTACATGGAGCTCAGCAGCCTGA 

|T -Si EDS A V Y~~ Y C H K H ■ F: jD y W G__(T^~J. 
CCTCTGAGGACTCTGCGGTCTATTACTGTTCAAGACGCTTTGACTACTGGGGCCAAGGGA 

It t v y . v. s ij 

CCACGGTCACCGTCTCCTCATAATAAGAGCTATGGGAGCTTGCATGCAAATTCTATTTCA 



AGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCG 

leader a a q p a m a [p j g l~T"~q s " p ~s~"~s~k — T~1C~~) 

CTCCCCAACCAGCGATGGCCnACATCGJiGCrCACCrAGTCTCrATC'ITCCATGTATGCAT 
CTCTAGGAGAGAGAATCACTATCACTTGCAA ;G GAGTCAGC <\CAT 4 J>"/ i7t*IT V. ' 



pelB 



[t . w y v * & g y 1 T""~k~t"~ 



CCTGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCAAACAGAT 

VL3418 Ll~OZjOOZ5I]l^^ 

tgctagatggggtcccatcaaggttcagtg'o'' j g ^atctc^gc \a at, ; it ^~t~?r.. 
ccatcagcagcctggactatgaagatatgggaatttattattgtctacaata^atg'^c^ J 

|X;:;;,Y K\ *: fr-,«: H fr** ■ X : ;--K--;-^'l 

tgtacacgttcggaggggggaccaagcjicgagatcaaacggtaataatgatcaaacggt 
ataaggatccagctcgaattc 
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pelB AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

leader 

AAAGLLLLAAQPAMA [qZJ .£ jQp 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAG CTGCAG 

GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 

I G F T F S S Y A Fi S"W V >R O,^.. S J3 <%J1lg 

GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCAGACCTCAGACAAGAGTCTG 

V H47 1 5 GAGTGGGTCGCAACCATCAGTAGTACTGATAOT ^ 

fp r"f t""T £ r i> ">r"G it U~'Y~~L '^'~L~Q~l*"ir"'sj 
GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 

CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 
FT F D Y W G O G T T V T V : Spsl 

TATTTTGACTACTGGGGCCAAGGGACCACGGICATCGTCTCCTCATAATAAGAGCTATGG 

M K Y L L P T 
pe IB GAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

leader 

AAAGLLLLAAQPAM _^ 

GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATC GAGCTC ArT'' 



CAGTCTCCATTCTCCCTGACTGTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCC 



CCAGGGCAGCCTCCTAAACTGTTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCT 

VL4715 _ 



.F, T A, S 0 g G T P ~F~T~T, 7 T 9 g vjj"] 
GATCGCTTCACAGCCAGTGGATCTGGAACAGATTTCACTCTCACCATCA GCP GTGTGC A( 

GCTGAAGACCTGGCAGTTTATTACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGA 

Myc-tag CiL jj„jOL_k"_? OEZFI ( £Qkliseedl ID 

GGGGGGACCAAGC_XCGA£ATCAAACGGGAACAAAAACTCATCTCAGAAGAGGATCTGAAT 
TAATAAGATCAAACGGTAATAAGGATCCAGCTCGAATTC 
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■ p. M K Y L L P T 

f JB ' D AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 
l6QQ6r 

AAAGLLLLAAQP AMA |Q V Q I.'""^ 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGC1SC&G 

l"s S~T G 5 L V "K P G € S 1, "t " L, :S C A ' 7 y~~*T 
GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 

R F ,T F S.-S V A F S W V P. O T j D : K- ■ : t7] 

GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCAGACCTCAGACAAGAGTCTG 

1 £. w v a " y i s s y . 0 t y : y- y y s d y vTi 

VH47 1 5 GAGTGGGTCGC ^ CCATCAGTAGTACT G ATAC,rTAT ACCTATTATTCAGACAATGTGAAG 

I G K F T I S R D K G K K T I* Y I, Q M S si 

GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 

["L K SBPT&VYYCAKHGYY G ~T T G ] 

CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 

!Y F D Y W G 0 G T : T V T V S fil |_G_ G G C~"~S ] 

Linker tattttgactactggggccaagggaccac££ica£cgtctcctcaggtggaggcggttca 

|G G G G ~S G G G G ■" I [ITT g i, T C S p F"^~1 
GGCGGAGGTGGCTCTGGCGGTGGCGGATCGGACATC£^G£ICACTCAGTCTCCATTCTCC 

r T~"T~v""r A G S ::iK.:UV. T K W C K : 3-; 3 Q S l7~T~i 

ctgactgtgacagcaggagagaaggtcactatgaattgcaagtccgct^ 

f¥sVMQ -P MYL:TWY "g~~~Q K P G Q : p g'""| 
AACAGTGTAAATCAGAGC-AACTACTTGACCTGGTACCAGCAGAAGCCAGGGrAGCCTCCT 

VI 471 5 

" LXILZOC— ~* s- y-iT s S G V P B R f . y a" I 

AAACTGTTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCTGATCGCTTCACAGCC 

^SZG^S_±_^^T~¥~ 1 T I S S V Q A- £~D^L~ TT| 
AGTGGATCTGGAACAGATTyCACTCTCACCATCAGCAGTGTGCAGGCTGAAGACCTGGCA 

IVY Y.C 0 K D Y T Y ' P F T F -G G . G T -K L ~| 
GTTTATTACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGGGGGGACCAAGCTC 
MyC'tag jXITU CE Q K L I 5 E E D~1~^ 

GAUATCAAACGGGAAGAAAAACTCATCTCAGAAGAGGATCTGAATTAATAAGATCAAACG 
GTAATAAGGATCCAGCTCGAATTC 



SUBSTITUTE SHEET (RULE 26) 



%5 



PCT/EP97/01609 



Fig.16A. 



AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 



pelB AAAGLLLLAAQPAM 

leader gcagccgctggattgttattactcgctgcccaaccagcgatggcccaggtgca gc^^ 



□ 



c <5 n i. v k ? <; g T , y Tj _ c 



gagtcagggggagacttagtgaagcctggagggtccctgacactctcctgtgcaacctct 



5 Y A F S W V 



GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCAGACCTCAGACAAGAGTCTG 
GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGACA/ TGTGAaL 
GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 
CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 

iS ^MZML £ XT EMMMEMM I g g g G ~n 

TATTTTGACTACTGGGGCCAAGGGACCACGGTCACiTGTCTCCTCAGGTGGAGGCGGTTCA 

jGGGGSGGGGSAG S^ [q~'"T"q"~Q "] 
GGCGGAGGTGGCTCTGGCGGTGGCGGATCGi]££GjmX^GC£CAGGTCCAGCTGCAACAG 



TACACATTCACTAGCTATGTTATGCACTGGGTGAAACAGAAGCCTGGGCAGGGCCTTGAG 



I w .j. 3 i _ i i mm _ u y K _ Y f{ E . K F R_ G ; 

VH341 8 TGGATTGGATATATTTATCC ^ ACAATG ^^ 

AAGGCCAC Tgp I = -v.. A^T" 7 """/ ~ ~'rS~T\7^ : ^T^^^~ S ^^ 
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rGAG ACTCTGCt rCTATTACTGTTCAAGACGCTTTGACTACTGGGGCCAAGGG 

:ggagctgcatgcaaattctatttcaag 



pelB 
leader 



VL3418 
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ACCACCGTCACCGTCTCCTCATAATAA GCTAGC ^ 



M K Y L L 



GAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCT 

A Q P A M A [^71Z^ZlZjiZSZ^~^ S~~1T^H^T~X"'1*"*~1 
GCCCAACCAGCGATGGCCGACATCGAGCTCACCCAGTCTCCATCT^CCATG^fGX^ 
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CTAGGAGAGAGAATCACTATCACT7GC/iA::C;.:GAC;TC-AGGACATTAATACCTATTTAA';C 
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TGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCAAACAGATTG 
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CTAGATGGGGTCCCATCAAGGTTCAGTGGCAGTGGATCTGGGCAAGATTATTCTCTCACC 
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ATCAGCAGCCTGGACTATGAAGATATGGGAATTTATTATTGTCTACAATATGATGAGTTG 

[> _'TT QO» T-TTt^ 3~ $,"~^~~T G G G G S G Gi 

TACACGTTCGGAGGGGGGACCAAGCTCGAGATCAAACGGGGTGGAGGCGGTTCAGGCGGA 

Linker 

|ggsggggv| ip_j. ... s Z ■■f q s "IT '"f " ~FTir~T ~l 

GGTGGCTCTGGCGGTGGCGGAGTCGACATCGAACTCACTCAGTr I C AT^CTCCCTGACT 

LZZOOO^ o L EiBs HI 

GTGACAGCAGGAGAGAAGGTCACTAT:;AATTGC^X;TCCGCTf:A~AGTCT^fT.^ACAG^' 
GTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGGCAGCCTCCTAAACTG 

VL4715 Goziii^iri^^ 

TTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCTGATCGCTTCACAGCCAGTGGA 

U -g/ F X 8 S V-Q A S D 1. A~~V rT Y~l 

TCTGGAACAGATTTCACTCTCACCATCAGCAGTGTGCAGGCTGAAGACCTGGCAGTTTAT 

TACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGGGGGGACCAAGCTCGAAATC 



AAACGGTAATAA GCGGCCGC GAATTC 
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Amp R 



ligate BstEU/SacI 
(Gly 4 Ser) 3 linker 
into vector 
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PstI BstEfl bad Xhol 
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PCR-I BstED /Sad 




AmpR 



SUBSTITUTE SHEET (RULE 26) 



PCT7EP97/01609 



Fig.18. 




Amp M 



remove 
Xhol / EcoRl 
fragment 



HindlD EcpRI 
PstI BstEU Sad Xhol 
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(Gly 4 Ser)3 linker 
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Pstl BstED Sfil Nhel 



pelB 
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» linker 
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PCR.V Sail /EcoRI 
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reaction 
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EcoRI 
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6 PCR.89 

7 PCR.90 

8 PCR.116 



10 DBL.2 

11 DBL.3 

12 DBL.4 

13 DBL.5 

14 DBL . 6 

15 Euka.l 

16 Euka.2 

17 ML.l 

18 ML. 2 

19 DBL. 7 

20 DBL. 8 

21 DBL. 9 

22 DBL. 10 : 5' 



' AGG T(C/G) (A/C) A(C/A)C TGC AG(C/G) AGT C(A/T)G G 
3' 

■ TGA GGA GAC GGT GAC CGT GGT CCC TTG GCC CC 3 ' 
' GAC ATT GAG CTC ACC CAG TCT CCA 3' 
' GTT AGA TCT CGA GCT TGG TCC C 3 ' 

' CAG GAT CCG GCC GGT TCG GCC CAG GTC CAG CTC CAA 

CAG TCA GGA '3 
1 CTA CAT GAA TTC GCT AGC TTA TTA TGA GGA GAC GGT 

GAC GGT GGT CCC TTG GC '3 
1 ATT GGA GTC GAC ATC GAA CTC ACT CAG TCT CCA TTC 

TCC 3 1 

CGA ATT CGG ATC CCC GTT TGA TTT CGA GCT TGG TCC 1 3 
GAG CGC GAG CTC GGC CGA ACC GGC CGA TCC GCC ACC 
GCC AGA GCC '3 

AAT GTC GAA TTC GTC GAC TCC GCC ACC GCC AGA GCC ' 3 
AGC TTC TAG ACC ACC ATG GAA AAC TGC AGA GCT CAA 
AAG CTA GCG CGG CGG CTC TAG '3 

AAT TCT AGA GCG GCC GCG CTA GCT TTT GAG CTC TGC 
AGT TTT CCA TGG TGG TCT AGA '3 

ACG GGT GAG CTC GAT GTC GGA GTG GAC ACC TGT GGA 
GAG A '3 

GGA AAC AGC TAT GAC CAT GAT TAC '3 
CAC CAT CTC CAG AGA CAA TGG CAA G 3' 
ACC AAG CTC GAG ATC AAA CGG GG 3 ' 

TGA AGT GAA TTC GCG GCC GCT TAT TAC CGT TTG ATT 
TCG AGC TTG GTC CC 3- 

TAA TAA GCT AGC GGA GCT GCA TGG AAA TTC TAT TTC 3 ' 

Table 1 



SUBSTITUTE SHEET (RULE 26) 



PCT/EP97/01609 



3 %5 



Code Expressed Antibody fragment 



pSVB 


VH4715-VH3418 


pSVD 


VH4715-VHIys 


pSVG 


VH3418-VHIys 


pSVK 


VH4715-VHIys-VH3418 


pSVM 


VHIys-VH4715-VH3418 


pSVN 


VL3418-VL4715.2t 


pSVP . 


VLIys-VL4715.2t 


pSVS : 


VLIys-VL3418.2t 


pSVV : 


VLIys-VL4715-VL3418.2t 


pSVW : 


VL3418-VLIys-VL4715.2t 



Table 2 



SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT ,ntoMUt ^p"" 00 " n ° 

PCT/EP 97/016G9 



A. CLASSIFICATION OF SUBJECT MATTER , , „ 

IPC 6 C12N15/13 C07K16/46 C07K16/0O C12N5/10 A61K39/395 
G01N33/577 

According to International Patent Claaificaaon (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Documentation searched other th 



Electronic data base consulted during the international se 



:h (name of data base and, w 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



WO 93 11161 A (ENZON, INC.) 10 June 1993 

cited in the application 

see page 22, line 1 - line 10 

see claims 

WO 94 09131 A (SCOTGEN LTD.) 28 April 1994 
cited in the application 
see claims 
see figures 

-/-- 



LH ' 




le or theory underlying the 



nt of particular relevance; the claimed in 



document member of the same patent family 



Date of the actual completion of the in 



14 August 1997 

md mailing address of the ISA 

European Patent Office, P.B. 5818 Patrntlaan 2 
NL ■ 2280 HV Rijswijk 
Tel. ( + 31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: ( + 31-70) 340-3016 



Date of mailing of the 

10.09.97 



Nooij, F 



page 1 of 2 



2 



INTERNATIONAL SEARCH REPORT 


PCT/EP 97/01609 


C.(Continua 


aon) DOCUMENTS CONSIDERED TO BE RELEVANT 












A 


PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF THE USA, 

vol. 90, no. 14, 15 July 1993, WASHINGTON, 
DC, USA, 

pages 6444-6448, XP002014058 
P. HOLLIGER ET AL. : ""Diabodies" : Small 
bivalent and bi specific antibody 
fragments. " 

see the whole document 




1-15 


A 


WO 94 13806 A (THE DOW CHEMICAL COMPANY) 
23 June 1994 
see figure 1 




1-15 


A 


WO 94 13804 A (CAMBRIDGE ANTIBODY 
TECHNOLOGY LTD. ET AL.) 23 June 1994 
see page 31, line 10 - line 12 
see figure 1 




1-15 


T 


WO 97 14719 A (UNILEVER) 24 April 1997 
see the whole document 




1-15 



Form PCT'ISA/310 (continuation of Mcono UMel) (July 1991) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



lnti .ionsil application No 

PCT/EP 97/01609 



Observations where certain claii 



Ie (Continuation of item 1 of first sheet) 



his International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

. |X| Claims Nos.: 14 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claim(s) 14 

1s(are) directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged 
effects of the compound/composition. 



■ Claims Nos.: 

because they relate to parts of the international Application that Qt 
an extent that no meaningful International Search can be carried 01 



^ be«™ *h°e S y are dependent claims and are not drafted in accordance w.th the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This Internauonal Searching Authority found multiple inventions in this international applicauon, as follows: 



□ As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. 

2. Q As all searchable claims could be searched without effort jusufymg an additional fee, this Authority did not invite payment 

of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this Internauonal Search Report 
1 1 covers only those claims for which fees were paid, specifically claims Nos.: 



| | The additional search fees were accompanied by the applicant s protest. 
| | No protest accompanied the payment of additional search fees. 



Form PCT/IS A/210 (continuation of first sheet (I)) (July 1992) 



INTERNATIONAL SEARCH REPORT 

Inhumation on patent family members 



PCT/EP 97/01609 



•(*) 



WO 9311161 A 



WO 9409131 A 



WO 9413806 A 23-06-94 



W0 9413804 A 23-06-94 



W0 9714719 A 24-04-97 



AU 3178993 A 

CA 2122732 A 

EP 0617706 A 

JP 7501451 T 



AU 5283793 A 

CA 2146854 A 

GB 2286189 A 

JP 8505761 T 



AU 5747794 A 

CA 2117477 A 

EP 0628078 A 

JP 7503622 T 



5654894 A 
2150262 A 
0672142 A 
8504100 T 
7621494 A 
2169620 A 
0720624 A 
9508577 A 



AU 6873396 A 



28-06-93 
10-06-93 
05-10-94 
16-02-95 



09-05-94 
28-04-94 
09-08-95 
25-06-96 



04-07-94 
23-06-94 
14-12-94 
20-04-95 



04-07-94 
23-06-94 
20-09-95 
07-05-96 
10-04-95 
30-03-95 
10-07-96 
30-03-95 



07-05-97 



Form PCT/ISA/210 (f»Mnt fimily mibm) (July 



